FIGURE l 



GGGGCTTCGGCGCCAGCGGCCAGCGCTAGTCGGTCTGGTAAGGATTTACAAAAGGTGCAGGTATG 
AGC AG G T C T G AAGAC T AAC AT T T T G T GAAG T T G T AAAAC AGAAAAC C T G T T AG AAATGT G G T GG T 
TTCAGCAAGGCCTCAGTTTCCTTCCTTCAGCCCTTGTAATTTGGACATCTGCTGCTTTCATATTT 
T C AT ACAT T AC TGC AGTAACAC T CCACCATATAGAC CCGGCT T T ACC T TAT AT C AGT GAC ACT GG 
TACAGTAGCTCCAGAAAAATGCTTATTTGGGGCAATGCTAAATATTGCGGCAGTTTTATGCATTG 
CTACCATTTATGTTCGTTATAAGCAAGTTCATGCTCTGAGTCCTGAAGAGAACGTTATCATCAAA 
TTAAACAAGGCTGGCCTTGTACTTGGAATACTGAGTTGTTTAGGACTTTCTATTGTGGCAAACTT 
U CCAGAAAACAACCCTTTTTGCTGCACATGTAAGTGGAGCTGTGCTTACCTTTGGTATGGGCTCAT 
□ TATATATGTTTGTTCAGACCATCCTTTCCTACCAAATGCAGCCCAAAATCCATGGCAAACAAGTC 
TTCTGGATCAGACTGTTGTTGGTTATCTGGTGTGGAGTAAGTGCACTTAGCATGCTGACTTGCTC 
fl ATCAGTTTTGCACAGTGGCAATTTTGGGACTGATTTAGAACAGAAACTCCATTGGAACCCCGAGG 
\ II ACAAAGGTTATGTGCTTCACATGATCACTACTGCAGCAGAATGGTCTATGTCATTTTCCTTCTTT 
GGT T TT T T CCT GAC T TACATT CGTGAT T T TCAGAAAATT TC T T TACGGGT GGAAGCCAAT T T ACA 
TGGATTAACCCTCTATGACACTGCACCTTGCCCTATTAACAATGAACGAACACGGCTACTTTCCA 
GAGATAT T TGATGAAAGGATAAAATAT T T CT GTAATGAT TATGATT C TCAGGGAT TGGGGAAAGG 
T TCACAGAAGT TGC T TAT TCTTCTCT GAAAT T T TCAACCACT TAATCAAGGCTGACAGT AACAC T 
GATGAA.TGCTGATAATCAGGAAACATGAAAGAAGCCATT TGATAGATTAT TCTAAAGGATATCAT 
C AAGAAGAC TAT T AAAAAC AC C TAT GC C TAT AC T T T T T TAT C T CAGAAAAT AAAG T C AAAAG AC T 
ATG 
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FIGURE 2 

<subunit 1 of 1, 266 aa, 1 stop 
<MW: 29766, pi: 8.39, NX(S/T): 0 

MWWFQQGLSFLPSALVIWTSAAFIFSYITAVTLHHIDPALPYISDTGTVAPEKCLFGAMLNIAAV 
LCIATIYVRYKQVHALSPEENVIIKLNKAGLVLGILSCLGLSIVANFQKTTLFAAHVSGAVLTFG 
MGSLYMFVQTILSYQMQPKIHGKQVFWIRLLLVIWCGVSALSMLTCSSVLHSGNFGTDLEQKLHW 
NPEDKGYVLHMITTAAEWSMS FSFFGFFLTYIRDFQKISLRVEANLHGLTLYDTAPCPINNERTR 
LLSRDI 

Important features: 

Type II transmembrane domain: 

amino acids 13-33 

Other Transmembrane domains: 

amino acids 54-73, 94-113, 160-180, 122-141 

N-myristoylation sites. 

amino acids 57-63, 95-101, 99-105, 124-130, 183-189 
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FIGURE ft 

CGGACGCGTGGGCGGACGCGTGGGGGAGAGCCGCAGTCCCGGCTGCAGCACCTGGGAGAAGGCAGACC 
GTGTGAGGGGGCCTGTGGCCCCAGCGTGCTGTGGCCTCGGGGAGTGGGAAGTGGAGGCAGGAGCCTTC 
CTTACACTT CGCCATGAGT T TCCTCATC GACT CCAGCAT CATGAT TACCTCCCAGATACTATTT T TTG 
GATT TGGGT GGCTTT T CTT CATGCGCCAATTGT TTAAAGACTATGAGAT ACGT CAGTAT GT TGTACAG 
GTGATCTTCTCCGTGACGTTTGCATTTTCTTGCACCATGTTTGAGCTCATCATCTTTGAAATCTTAGG 
AGTATTGAATAGCAGCTCCCGTTATTTTCACTGGAAAATGAACCTGTGTGTAATTCTGCTGATCCTGG 
TTTTCATGGTGCCTTTTTACATTGGCTATTTTATTGTGAGCAATATCCGACTACTGCATAAACAACGA 
CTGCTTTTTTCCTGTCTCTTATGGCTGACCTTTATGTATTTCTTCTGGAAACTAGGAGATCCCTTTCC 
CATTCTCAGCCCAAAACATGGGATCTTATCCATAGAACAGCTCATCAGCCGGGTTGGTGTGATTGGAG 
TGACTCTCATGGCTCTTCTTTCTGGATTTGGTGCTGTCAACTGCCCATACACTTACATGTCTTACTTC 
CTCAGGAATGTGACTGACACGGATATTCTAGCCCTGGAACGGCGACTGCTGCAAACCATGGATATGAT 
CATAAGCAAAAAGAAAAGGATGGCAATGGCACGGAGAACAATGTTCCAGAAGGGGGAAGTGCATAACA 
AACCATCAGGTTTCTGGGGAATGATAAAAAGTGTTACCACTTCAGCATCAGGAAGTGAAAATCTTACT 
CTTATTCAACAGGAAGTGGATGCT T TGGAAGAATTAAGCAGGCAGCTT T TT CT GGAAACAGCTGATCT 
ATAT GCTAGCAAGGAGAGAATAGAATACT CCAAAACCTT CAAGGGGAAATAT T TTAATT TTCTT GGT T 
ACTTTTTCTCTATTTACTGTGTTTGGAAAATTTTCATGGCTACCATCAATATTGTTTTTGATCGAGTT 
GGGAAAACGGATCCTGTCACAAGAGGCATTGAGATCACTGTGAATTATCTGGGAATCCAATTTGATGT 
GAAGTTTTGGTCCCAACACATTTCCTTCATTCTTGTTGGAATAATCATCGTCACATCCATCAGAGGAT 
TGCTGATCACTCTTACCAAGTTCTTTTATGCCATCTCTAGCAGTAAGTCCTCCAATGTCATTGTCCTG 
CTATTAGCACAGATAATGGGCATGTACTTTGTCTCCTCTGTGCTGCTGATCCGAATGAGTATGCCTTT 
AGAATACCGCACCATAATCACTGAAGTCCTTGGAGAACTGCAGTTCAACTTCTATCACCGTTGGTTTG 
ATGTGATCTTCCTGGTCAGCGCTCTCTCTAGCATACTCTTCCTCTATTTGGCTCACAAACAGGCACCA 
GAGAAGCAAAT GGCACCTTGAACTT AAGCCTACTACAGACTGT TAGAGGCCAGTGGTTT CAAAATTTA 
GATATAAGAGGGGGGAAAAATGGAACCAGGGCCTGACATTTTATAAACAAACAAAATGCTATGGTAGC 
ATTT TTCAC CTT CATAGCATACT CCTTCCCCGT CAGGTGATACTATGACCATGAGTAGCAT CAGCCAG 
AACATGAGAGGGAGAACTAACTCAAGACAATACTCAGCAGAGAGCATCCCGTGTGGATATGAGGCTGG 
TGTAGAGGCGGAGAGGAGCCAAGAAACTAAAGGTGAAAAATACACTGGAACTCTGGGGCAAGACATGT 
CTAT GGTAGCT GAGCCAAACACGT AGGAT TTCCGT T TTAAGGT TCACAT GGAAAAGGTTATAGCTTTG 
CCTTGAGATTGACTCATTAAAATCAGAGACTGTAACAAAAAAAAAAAAAAAAAAAAAGGGCGGCCGCG 
ACTCTAGAGTCGACCTGCAGAAGCTTGGCCGCCATGGCCCAACTTGTTTATTGCAGCTTATAATG 
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FIGURE A 

MSFLIDSSIMITSQILFFGFGWLFFMRQLFKDYEIRQYWQVIFSVTFAFSCTMFELIIFEILGV 
LNS S SRY FHWKMNLCVI LL I LVFMVP FY I GYFI VSNI RLLHKQRLLFS CLLWLT EMYFFWKLGDP 
FPILSPKHGILSIEQLISRVGVIGVTLMALLSGFGAVNCPYTYMSYFLRNVTDTDILALERRLLQ 
TMDMIISKKKRMAMARRTMFQKGEVHNKPSGFWGMIKSVTTSASGSENLTLIQQEVDALEELSRQ 
LFLETADLYATKERIEYSKTFKGKYFNFLGYFFSIYCVWKIFMATINIVFDRVGKTDPVTRGIEI 
TWYLGIQFDTOFWSQHISFILVGIIIVTSIRGLLITLTKFFYAISSSKSSNVIVLLLAQIMGMY 
FVSSVLLIRMSMPLEYRTIITEVLGELQFNFYHRWFDVIFLVSALSSILFLYLAHKQAPEKQMAP 

Important features : 
Signal peptide: 

amino acids 1-23 

Potential transmembrane domains : 

amino acids 37-55, 81-102, 150-168, 288-311, 338-356, 375-398, 
425-444 

N-glycosylation sites. 

amino acids 67-70, 180-183 and 243-246 

Eukaryotic cobalamin-binding proteins 

amino acids 151-160 
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FIGURE a 

AGCAGGGAAATCCGGATGTCTCGGTTATGAAGTGGAGCAGTGAGTGTGAGCCTCAACATAGTTCC 
AGAACTCTCCATCCGGACTAGTTATTGAGCATCTGCCTCTCATATCACCAGTGGCCATCTGAGGT 
GTTTCCCTGGCTCTGAAGGGGTAGGCACGATGGCCAGGTGCTTCAGCCTGGTGTTGCTTCTCACT 
TCCATCTGGACCACGAGGCTCCTGGTCCAAGGCTCTTTGCGTGCAGAAGAGCTTTCCATCCAGGT 
GTCATGCAGAAT TATGGGGATCAG C C TT GT GAGCAAAAAGGCGAACCAGCAGC T GAAT TT CACAG 
AAGCTAAGGAGGCCTGTAGGCTGCTGGGACTAAGTTTGGCCGGCAAGGACCAAGTTGAAACAGCC 
TTGAAAGCTAGCTTTGAAACTTGCAGCTATGGCTGGGTTGGAGATGGATTCGTGGTCATCTCTAG 
GATTAGCCCAAACCCCAAGTGTGGGAAAAATGGGGTGGGTGTCCTGATTTGGAAGGTTCCAGTGA 
GCCGACAGTT T GCAGCC TATTG T TACAAC TCAT CT GATACT TGGAC TAACT CGT GCAT TCCAGAA 
AT TAT C AC C AC C AAAG AT C C CAT AT T C AAC AC T C AAAC T G C AAC AC AAAC AAC AGAAT T TAT T G T 
CAGTGACAGTACCTACTCGGTGGCATCCCCTTACTCTACAATACCTGCCCCTACTACTACTCCTC 
CTGCTCCAGCTTCCACTTCTATTCCACGGAGAAAAAAATTGATTTGTGTCACAGAAGTTTTTATG 
GAAAC TAGCAC CATGTC TACAGAAAC T GAACCAT T TGT T GAAAATAAAGCAGCAT TCAAGAAT GA 
AGCTGCTGGGTTTGGAGGTGTCCCCACGGCTCTGCTAGTGCTTGCTCTCCTCTTCTTTGGTGCTG 
CAGCTGGTCTTGGATTTTGCTATGTCAAAAGGTATGTGAAGGCCTTCCCTTTTACAAACAAGAAT 
CAGCAGAAGGAAAT GAT CGAAACCAAAGTAGTAAAGGAGGAGAAGGCCAAT GATAGCAACCC T AA 
TGAGGAAT CAAAGAAAAC T GATAAAAACCCAGAAGAGT CCAAGAGT CCAAGCAAAACTAC CGT GC 
GATGCC T GGAAGCT GAAGT T TAGATGAGACAGAAATGAGGAGACACACC TGAGGC TGGTTTCTTT 
CATGCTCCTTACCCTGCCCCAGCTGGGGAAATCAAAAGGGCCAAAGAACCAAAGAAGAAAGTCCA 
CCCT TGGT TC CTAAC T GGAATCAGC T CAGGAC TGC CAT TGGAC TATGGAG T GCACC AAAG AGAAT 
GCCCTTCTCCTTATTGTAACCCTGTCTGGATCCTATCCTCCTACCTCCAAAGCTTCCCACGGCCT 
TTCTAGCCTGGCTATGTCCTAATAATATCCCACTGGGAGAAAGGAGTTTTGCAAAGTGCAAGGAC 
CTAAAACATCTCATCAGTATCCAGTGGTAAAAAGGCCTCCTGGCTGTCTGAGGCTAGGTGGGTTG 
AAAGCCAAGGAGTCACT GAGACCAAGGCT T TC T CTAC TGAT TCCGCAGCT CAGACCC T TT CT TCA 
GCTC TGAAAGAGAAACACGTAT CCCACCT GACATGT CCT TC TGAGCCCGGTAAGAGCAAAAGAAT 
GGCAGAAAAGT T TAGCCCC TGAAAGCCAT GGAGAT TC TCAT AAC TT GAGACCTAATC T CT GTAAA 
GC TAAAAT AAAGAAAT AGAACAAGGC T GAG GAT AC GACAG T ACAC TGT CAG C AGGGAC T G T AAAC 
ACAGACAGGGT CAAAGT GT TT TC TC TGAACACAT TGAGT TGGAATCAC T GTT TAGAACACACACA 
CTTACTTTTTCTGGTCTCTACCACTGCTGATATTTTCTCTAGGAAATATACTTTTACAAGTAACA 
AAAATAAAAAC TCTTATAAAT TTCTATTT TTATCTGAGT TACAGAAAT GATTACTAAGGAAGATT 
ACTCAGTAAT T TGT T TAAAAAGTAATAAAAT TCAACAAACAT T TGC TGAATAGC TACTATAT GTC 
AAGTGCT GTGCAAGGTAT T ACAC TC TGTAAT TGAATATT AT TCC TCAAAAAATTGCACATAG TAG 
AACGCTATCTGGGAAGCTATTTTTTTCAGTTTTGATATTTCTAGCTTATCTACTTCCAAACTAAT 
TTTTATTTTTGCTGAGACTAATCTTATTCATTTTCTCTAATATGGCAACCATTATAACCTTAATT 
TAT TATTAACATACC TAAGAAGTACAT TGT TACCTC T AT AT AC C AAAG C AC AT T T TAAAAGT GCC 
ATTAACAAATGTATCAC TAGCCC TCC T T T T TCCAACAAGAAGGGAC TGAGAGATGCAGAAATAT T 
TGTGACAAAAAATTAAAGCATT TAGAAAACTT 
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FIGURE 6 

MARCFSLVLLLTSIWTTRLLVQGSLRAEELSIQVSCRIMGITLVSKKANQQLNFTEAKEACRLLG 
LS LAGKDQVETALKAS FE TCS YGWVGDGFWI SRI S PNPKCGKNGVGVL IWKVPVSRQFAAYCYN 
SSDTWTNSCIPEIITTKDPIFNTQTATQTTEFIVSDSTYSVASPYSTIPAPTTTPPAPASTSIPR 
RKKLICVTEVFMETSTMSTETEPFVENKAAFKNEAAGFGGVPTALLVLALLFFGAAAGLGFCYVK 
RYVKAFPFTNKNQQKEMIETKVVKEEKANDSNPNEESKKTDKNPEESKSPSKTTVRCLEAEV 

Signal sequence: 

amino acids 1-16 



WJ Transmembrane domain: 

amino acids 235-254 

fjl N-glycosylation site. 

ill . amino acids 53-57, 130-134, 289-293 



Casein kinase II phosphorylation site. 

amino acids 145-149, 214-218. 

Tyrosine kinase phosphorylation site. 

amino acids 79-88 

N-myristoylation site. 

amino acids 23-29, 65-71, 234-240, 235-239, 249-255, 253-259 
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FIGURE 7 



CGCCGCGCTCCCGCACCCGCGGCCCGCCCACCGCGCCGCTCCCGCATCTGCACCCGCAGCCCGGC 
GGCCTCCCGGCGGGAGCGAGCAGATCCAGTCCGGCCCGCAGCGCAACTCGGTCCAGTCGGGGCGG 
CGGCTGCGGGCGCAGAGCGGAGATGCAGCGGCTTGGGGCCACCCTGCTGTGCCTGCTGCTGGCGG 
CGGCGGTCCCCACGGCCCCCGCGCCCGCTCCGACGGCGACCTCGGCTCCAGTCAAGCCCGGCCCG 
GCTCTCAGCTACCCGCAGGAGGAGGCCACCCTCAATGAGATGTTCCGCGAGGTTGAGGAACTGAT 
GGAGGACACGCAGCACAAAT TGCGCAGCGCGGTGGAAGAGATGGAGGCAGAAGAAGCTGCT GCTA 
AAGCAT CATCAGAAGT GAACC TGGCAAACT T AC CTCCCAGCTAT CACAAT GAGACCAACACAGAC 
ACGAAGGTTGGAAAT7\ATACCATCCATGTGCACCGAGAAATTCACAAGATAACCAACAACCAGAC 
TGGACAAAT GGTC T T TT CAGAGACAGT TATCACAT C T G T GGGAGACGAAGAAGGCAGAAGGAGCC 
ACGAGTGCATCATCGACGAGGACTGTGGGCCCAGCATGTACTGCCAGTTTGCCAGCTTCCAGTAC 
ACCTGCCAGCCATGCCGGGGCCAGAGGATGCTCTGCACCCGGGACAGTGAGTGCTGTGGAGACCA 
GCTGTGTGTCTGGGGTCACTGCACCAAAATGGCCACCAGGGGCAGCAATGGGACCATCTGTGACA 
ACCAGAGGGACTGCCAGCCGGGGCTGTGCTGTGCCTTCCAGAGAGGCCTGCTGTTCCCTGTGTGC 
ACACCCCTGCCCGTGGAGGGCGAGCTTTGCCATGACCCCGCCAGCCGGCTTCTGGACCTCATCAC 
CTGGGAGCTAGAGCCTGATGGAGCCTTGGACCGATGCCCTTGTGCCAGTGGCCTCCTCTGCCAGC 
CCCACAGCCACAGCCTGGTGTATGTGTGCAAGCCGACCTTCGTGGGGAGCCGTGACCAAGATGGG 
GAGATCCTGCTGCCCAGAGAGGTCCCCGATGAGTATGAAGTTGGCAGCTTCATGGAGGAGGTGCG 
CCAGGAGCTGGAGGACCTGGAGAGGAGCCTGACTGAAGAGATGGCGCTGGGGGAGCCTGCGGCTG 
CCGCCGCTGCACTGCTGGGAGGGGAAGAGATT TAGA TCTGGACCAGGCTGTGGGTAGATGTGCAA 
TAGAAATAGCTAATTTATTTCCCCAGGTGTGTGCTTTAGGCGTGGGCTGACCAGGCTTCTTCCTA 
CATCTTCTTCCCAGTAAGTTTCCCCTCTGGCTTGACAGCATGAGGTGTTGTGCATTTGTTCAGCT 
CCCCCAGGCTGTTCTCCAGGCTTCACAGTCTGGTGCTTGGGAGAGTCAGGCAGGGTTAAACTGCA 
GGAGCAGT T T GCCACCC C T GT CCAGAT TAT TGGC TGCT T T GCC T CTAC CAG T TGGCAGACAGCCG 
TTTGTTCTACATGGCTTTGATAATTGTTTGAGGGGAGGAGATGGAAACAATGTGGAGTCTCCCTC 
TGATTGGTTTTGGGGAAATGTGGAGAAGAGTGCCCTGCTTTGCAAACATCAACCTGGCAAAAATG 
CAACAAATGAATTTTCCACGCAGTTCTTTCCATGGGCATAGGTAAGCTGTGCCTTCAGCTGTTGC 
AGATGAAATGTTCTGTTCACCCTGCATTACATGTGTTTATTCATCCAGCAGTGTTGCTCAGCTCC 
TACCTCTGTGCCAGGGCAGCATTTTCATATCCAAGATCAATTCCCTCTCTCAGCACAGCCTGGGG 
AGGGGGTCATTGTTCTCCTCGTCCATCAGGGATCTCAGAGGCTCAGAGACTGCAAGCTGCTTGCC 
CAAGTCACACAGCTAGTGAAGACCAGAGCAGTTTCATCTGGTTGTGACTCTAAGCTCAGTGCTCT 
CTCCACTACCCCACACCAGCCTTGGTGCCACCAAAAGTGCTCCCCAAAAGGAAGGAGAATGGGAT 
TTTTCTTGAGGCATGCACATCTGGAATTAAGGTCAAACTAATTCTCACATCCCTCTAAAAGTAAA 
C T AC TGT T AGGAACAGCAGTGT T CT CACAGTGTGGGGCAGCCGTCC T TCTAATGAAGACAAT GAT 
ATTGACACTGTCCCTCTTTGGCAGTTGCATTAGTAACTTTGAAAGGTATATGACTGAGCGTAGCA 
TACAGGT TAACC TGCAGAAACAGTAC T TAGGTAATTGTAGGGCGAGGAT TAT AAATGAAATT TGC 
AAAAT CACT TAGCAGCAAC TGAAGACAAT TATCAACCACGT GGAGAAAAT CAAACCGAGCAGGGC 
T GTGT GAAACATGGT TGTAATAT GCGAC T GCGAACACTGAACTC TACGCCACTCCACAAATGAT G 
TTTTCAGGTGTCATGGACTGTTGCCACCATGTATTCATCCAGAGTTCTTAAAGTTTAAAGTTGCA 
CATGATT GTAT AAGCAT GC TT TC TT TGAGT TT TAAAT TAT GTATAAACAT AAGT TGCAT T TAGAA 
ATCAAGCATAAATCACTTCAACTGCAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 8 



MQRLGATLLCLLIAAAVPTAPAPAPTATSAPVKPGPALSYPQEEATLNEMFREVEELMEDTQHKL 
RSAVEEMEAEEAAAKASSEWLANLPPSYHNETNTDTKVGNNTIHVHREIHKITNNQTGQMVFSE 
TVITSVGDEEGRRSHECIIDEDCGPSMYCQFAS FQYTCQPCRGQRMLCTRDSECCGDQLCVWGHC 
TKMATRGSNGTICDNQRDCQPGLCCAFQRGLLFPVCTPLPVEGELCHDPASRLLDLITWELEPDG 
ALDRCPCASGLLCQPHSHSLVYVCKPTEVGSRDQDGEILLPREVPDEYEVGSEMEEVRQELEDLE 
RSLTEEMALGEPAAAAAALLGGEEI 



L4 Signal sequence: 

W amino acids 1-19 

y s 

1,1 N-glycosylation site. 

\n amino acids 96-100, 106-110, 121-125, 204-208 

m 

w 

111 • •• • . • 

y< Casein kinase II phosphorylation site. 

% amino acids 46-50, 67-71, 98-102, 135-139, 206-210, 312-316, 

S 327-331 

f|j N-rnyristoylation site. 

O amino acids 202-208, 217-223 

\U 

Amidation site. 

amino acids 140-144 
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FIGURE Q 



CGGACGCGTGGGCGGACGCGTGGGGGCTGTGAGAAAGTGCCAATAAATACATCATGCAACCCCAC 
. GGCCCACCTTGTGAACTCCTCGTGCCCAGGGCTGATGTGCGTCTTCCAGGGCTACTCATCCAAAG 
GCCTAATCCAACGTTCTGTCTTCAATCTGCAAATCTATGGGGTCCTGGGGCTCTTCTGGACCCTT 
AACTGGGTACTGGCCCTGGGCCAATGCGTCCTCGCTGGAGCCTTTGCCTCCTTCTACTGGGCCTT 
CCACAAGCCCCAGGACATCCCTACCTTCCCCTTAATCTCTGCCTTCATCCGCACACTCCGTTACC 
ACACTGGGTCATTGGCATTTGGAGCCCTCATCCTGACCCTTGTGCAGATAGCCCGGGTCATCTTG 
GAGTATATTGACCACAAGCTCAGAGGAGTGCAGAACCCTGTAGCCCGCTGCATCATGTGCTGTTT 
CAAGTGCTGCCTCTGGTGTCTGGAAAAATTTATCAAGTTCCTAAAGCGCAATGCATACATCATGA 
TCGCCATCTACGGGAAGAATTTCTGTGTCTCAGCCAAAAATGCGTTCATGCTACTCATGCGAAAC 
ATTGTCAGGGTGGTCGTCCTGGACAAAGTCACAGACCTGCTGCTGTTCTTTGGGAAGCTGCTGGT 
GGTCGGAGGCGTGGGGGTCCTGTCCTTCTTTTTTTTCTCCGGTCGCATCCCGGGGCTGGGTAAAG 
ACTTTAAGAGCCCCCACCTCAACTATTACTGGCTGCCCATCATGACCTCCATCCTGGGGGCCTAT 
gl GTCATCGCCAGCGGCTTCTTCAGCGTTTTCGGCATGTGTGTGGACACGCTCTTCCTCTGCTTCCT 
U1 GGAAGACCTGGAGCGGAACAACGGCTCCCTGGACCGGCCCTACTACATGTCCAAGAGCCTTCTAA 
%. AGATTCTGGGCAAGAAGAACGAGGCGCCCCCGGACAACAAGAAGAGGAAGAAGTGACAGCTCCGG 
CCCTGATCCAGGACTGCACCCCACCCCCACCGTCCAGCCATCCAACCTCACTTCGCCTTACAGGT 
F% CTCCATTTTGTGGTAAAAAAAGGTTTTAGGCCAGGCGCCGTGGCTCACGCCTGTAATCCAACACT 
ffl TTGAGAGGCTGAGGCGGGCGGATCACCTGAGTCAGGAGTTCGAGACCAGCCTGGCCAACATGGTG 
Q AAACCTCCGTCTCTATTAAAAATACAAAAATTAGCCGAGAGTGGTGGCATGCACCTGTCATCCCA 
111 GCTACTCGGGAGGCTGAGGCAGGAGAATCGCTTGAACCCGGGAGGCAGAGGTTGCAGTGAGCCGA 
GATCGCGCCACTGCACTCCAACCTGGGTGACAGACTCTGTCTCCAAAACAAAACAAACAAACAAA 
AAGATTT TATTAAAGATATTT TGTTAACTC 



til 
III 
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FIGURE to 



RTRGRTRGGCEKVP I NT S CNP TAHLVNS S CPGLMCVFQGYS SKGLI QRSVFNLQ I YGVLGL FWTL 
NWVLALGQCVLAGAFAS FYWAFHKPQDI PT FPL I S AFI RTLRYHTGS LAFGAL I LTLVQ I ARVI L 
EYIDHKLRGVQNPVARCIMCCFKCCLWCLEKFIKFLNRNAYIMIAIYGKNFCVSAKNAFMLLMRN 
IWVWLDKVTDLLLFFGKLLWGGVGVLSFFFFSGRIPGLGKDFKSPHLNYYWLPIMTSILGAY 
VIASGFFSVFGMCVDTLFLCFLEDLERNNGSLDRPYYMSKSLLKILGKKNEAPPDNKKRKK 

Important features : 
1*4. Transmembrane domains: 

CI amino acids 57-80 (type II), 110-126, 215-231, 254-274 

; r > N-glycosylation sites. 

jj] amino acids 16-20, 27-31, 289-293 

ff| 

*&i Hypothetical YBR002c family proteins. 

'~ m amino acids 276-288 

rj Ammonium transporters proteins. 

ffj amino acids 204-231 

iti 

N-myristoylation sites. 

amino acids 60-66, 78-84 

Amidation site. 

amino acids 306-310 



App_ID=l 0063565 



Page 162 of 320 



FIGURE 11 



GCCCCGCGCCCGGCGCCGGGCGCCCGAAGCCGGGAGCCACCGCCA.TGGGGGCCTGCCTGGGA6CCTGC 
TCCCTGCTCAGCTGCGCGTCCTGCCTCTGCGGCTCTGCCCCCTGCATCCTGTGCAGCTGCTGCCCCGC 
CAGCCGCAACTCCACCGTGAGCCGCCTCATCTTCACGTTCTTCCTCTTCCTGGGGGTGCTGGTGTCCA 
TCATTATGCTGAGCCCGGGCGTGGAGAGTCAGCTCTACAAGCTGCCCTGGGTGTGTGAGGAGGGGGCC 
GGGATCCCCACCGTCCTGCAGGGCCACATCGACTGTGGCTCCCTGCTTGGCTACCGCGCTGTCTACCG 
CATGTGCTTCGCCACGGCGGCCTTCTTCTTCTTCTTTTTCACCCTGCTCATGCTCTGCGTGAGCAGCA 
GCCGGGACCCCCGGGCTGCCATCCAGAATGGGTTTTGGTTCTTTAAGTTCCTGATCCTGGTGGGCCTC 
ACCGTGGGTGCCTTCTACATCCCTGACGGCTCCTTCACCAACATCTGGTTCTACTTCGGCGTCGTGGG 
CI CTCCTTCCTCTTCATCCTCATCCAGCTGGTGCTGCTCATCGACTTTGCGCACTCCTGGAACCAGCGGT 
GGCTGGGCAAGGCCGAGGAGTGCGATTCCCGTGCCTGGTACGCAGGCCTCTTCTTCTTCACTCTCCTC 
ff| T T C T AC T T GCTG T CGAT CGC GG C C G T G GC GC T GAT G TT CAT GT AC T ACACT GAGC C CAG CGGC T GCC A 

y] CGAGGGCAAGGTCTTCATCAGCCTCAACCTCACCTTCTGTGTCTGCGTGTCCATCGCTGCTGTCCTGC 
Ul CCAAGGTCCAGGACGCCCAGCCCAACTCGGGTCTGCTGCAGGCCTCGGTCATCACCCTCTACACCATG 
TTTGTCACCTGGTCAGCCCTATCCAGTATCCCTGAACAGAAATGCAACCCCCATTTGCCAACCCAGCT 
G GGCAAC GAGACAGT T GTG GCAG GC C C C GAGG GCTAT GAGACC CAGT GGTG GGAT G C CCCGAGC ATT G 
TGGGCCTCATCATCTTCCTCCTGTGCACCCTCTTCATCAGTCTGCGCTCCTCAGACCACCGGCAGGTG 
j fl AACAGCCTGATGCAGACCGAGGAGTGCCCACCTATGCTAGACGCCACACAGCAGCAGCAGCAGCAGGT 
Q GGCAGCCTGTGAGGGCCGGGCCTTTGACAACGAGCAGGACGGCGTCACCTACAGCTACTCCTTCTTCC 
ff| ACTTCTGCCTGGTGCTGGCCTCACTGCACGTCATGATGACGCTCACCAACTGGTACAAGCCCGGTGAG 

■P ACCCGGAAGATGATCAGCACGTGGACCGCCGTGTGGGTGAAGATCTGTGCCAGCTGGGCAGGGCTGCT 

f\\ 

s -* CCTCTACCTGTGGACCCTGGTAGCCCCACTCCTCCTGCGCAACCGCGACTTCAGCTGAGGCAGCCTCA 
CAGCCTGCCATCTGGTGCCTCCTGCCACCTGGTGCCTCTCGGCTCGGTGACAGCCAACCTGCCCCCTC 
CCCACACCAATCAGCCAGGCTGAGCCCCCACCCCTGCCCCAGCTCCAGGACCTGCCCCTGAGCCGGGC 
CTTCTAGTCGTAGTGCCTTCAGGGTCCGAGGAGCATCAGGCTCCTGCAGAGCCCCATCCCCCCGCCAC 
ACCCACACGGTGGAGCTGCCTCTTCCTTCCCCTCCTCCCTGTTGCCCATACTCAGCATCTCGGATGAA 
AGGGCTCCCTTGTCCTCAGGCTCCACGGGAGCGGGGCTGCTGGAGAGAGCGGGGAACTCCCACCACAG 
TGGGGCATCCGGCACTGAAGCCCTGGTGTTCCTGGTCACGTCCCCCAGGGGACCCTGCCCCCTTCCTG 
GACTTCGTGCCTTACTGAGTCTCTAAGACTTTTTCTAATAAACAAGCCAGTGCGTGTAAAAAAAA 
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FIGURE 12 

MGACLGACSLLSCASCLCGSAPCILCSCCPASRNSTVSRLIFTFFLFLGVLVSIIMLSPGVESQL 
YKLPWVCEEGAGIPTVLQGHIDCGSLLGYRAVYRMCFATAAFFFFFFTLLMLCVSSSRDPRAAIQ 
NGFWFFKFLILVGLTVGAFYIPDGSFTNIWFYFGWGSFLFILIQLVLLIDFAHSWNQRWLGKAE 
ECDSRAWYAGLFFFTLLFYLLSIAAVALMFMYYTEPSGCHEGKVFISLNLTFCVCVS IAAVLPKV 
QDAQPNSGLLQASVITLYTMFVTWSALSSIPEQKCNPHLPTQLGNETWAGPEGYETQWWDAPSI 
VGL IIFLLCTLFISLRSS DHRQVNS LMQT EE C P PMLDAT QQQQQQVAACE GRAFDNE QDG VT Y S Y 
SFFHFCLVIASLHVMMTLTNWYKPGETRKMISTWTAVWVKICASWAGLLLYLWTLVAPLLLRNRD 
FS 

Signal sequence: 

amino acids 1-20 

Transmembrane domains: 

amino acids 40-58, 101-116, 134-150, 162-178, 206-223, 240-257, 
272-283, 324-340, 391-406, 428-444 
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FIGURE 13 

CGGGCCAGCCTGGGGCGGCCGGCCAGGAACCACCCGTTAAGGTGTCTTCTCTTTAGGGATGGTGA 
GGT T GGAAAAAGAC TCC T GTAACC CTCC T CCAGG ATGA ACCACCT GCCAGAAGACATGGAGAACG 
CTCTCACCGGGAGCCAGAGCTCCCATGCTTCTCTGCGCAATATCCATTCCATCAACCCCACACAA 
C TCATGGCCAGGAT TGAGTCCTAT GAAGGAAGGGAAAAGAAAGGCATATC T GAT GT CAGGAGGAC 
TTTCTGTTTGTTTGTCACCTTTGACCTCTTATTCGTAACATTACTGTGGATAATAGAGTTAAATG 
TGAATGGAGGCAT TGAGAACACAT TAGAGAAGGAGGT GATGCAGTAT GACT AC TATT CTT CATAT 
TTTGATATATTTCTTCTGGCAGTTTTTCGATTTAAAGTGTTAATACTTGCATATGCTGTGTGCAG 
AC TGCGCCAT T GGT GGGCAATAGCGT T GACAACGGCAGT GACCAGTGCC T T T T TAC TAGCAAAAG 
TGATCCTTTCGAAGCTTTTCTCTCAAGGGGCTTTTGGCTATGTGCTGCCCATCATTTCATTCATC 
C TTGCCT GGAT TGAGACGT GGT TCCT GGAT T T CAAAGT GT TACC TCAAGAAGCAGAAGAAGAAAA 
CAGACTCCTGATAGTTCAGGATGCTTCAGAGAGGGCAGCACTTATACCTGGTGGTCTTTCTGATG 
GTCAGTTTTATTCCCCTCCTGAATCCGAAGCAGGATCTGAAGAAGCTGAAGAAAAACAGGACAGT 
GAGAAACCACT T T TAGAAC T ATGA G TAC TACT T TT GT TAAAT GT GAAAAAC CC T CACAGAAAGTC 
ATCGAGGCAAAAAGAGGCAGGCAGTGGAGTCTCCCTGTCGACAGTAAAGTTGAAATGGTGACGTC 
CACTGCTGGCTT TATTGAACAGCTAATAAAGAT TTATTTAT TGTAATACCTCACAAACGT TGTAC 
CATATCCATGCACATTTAGTTGCCTGCCTGTGGCTGGTAAGGTAATGTCATGATTCATCCTCTCT 
TCAGTGAGACTGAGCCTGATGTGTTAACAAATAGGTGAAGAAAGTCTTGTGCTGTATTCCTAATC 
AAAAGAC T TAATATATTGAAGTAACACT T T T T TAGTAAGCAAGATACCTT T T TAT T TCAAT TCAC 
AGAATGGAATTTTTTTGTTTCATGTCTCAGATTTATTTTGTATTTCTTTTTTAACACTCTACATT 
TCCC T TGT T T T T TAACT CATGCACAT GT GC TC T T T GTACAGT T T TAAAAAG TGTAATAAAAT CTG 
ACATGTCAATGTGGCTAGTTTTATTTTTCTTGTTTTGCATTATGTGTATGGCCTGAAGTGTTGGA 
C TT GCAAAAGGGGAAGAAAGGAAT TGCGAATACATGTAAAAT GT CACCAGACAT T TGTAT TATT T 
T TAT CAT GAAAT CAT GT T T T T C TC TGAT TGTT C TGAAAT GT T C TAAAT ACT CT TATT TTGAATGC 
ACAAAATGACTTAAACCATTCATATCATGTTTCCTTTGCGTTCAGCCAATTTCAATTAAAATGAA 
CTAAAT TAAAAA 
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FIGURE 14. 

MNHLPEDMENALTGSQSSHASLRNIHSINPTQLMARIESYEGREKKGISDYRRTFCLFVTFDLLF 
VTLLWIIELNVNGGIENTLEKEVMQYDYYSSYFDIFLLAVFRFKVLIIAYAVCRLRHWWAIALTT 
AVTSAFLLAKVILSKLFSQGAFGYVLPIISFILAWIETWFLDFKVLPQEAEEENRLLIVQDASER 
AAL I PGGLSDGQFYS PPE SE AGSEEAEEKQDSEKPLLEL 

Important features of the protein: 
Signal peptide : 

amino acids 1-20 

Transmembrane domains: 

amino acids 54-72, 100-118, 130-144, 146-166 
N-myristoylation sites. 

amino acids 14-20, 78-84, 79-85, 202-208, 217-223 



App_ID=10063565 



Page 166 of 320 



FIGURE 15 



ACTCGAACGCAGTTGCTTCGGGACCCAGGACCCCCTCGGGCCCGACCCGCCAGGAAAGACTGAGG 
CCGCGGCCTGCCCCGCCCGGCTCCCTGCGCCGCCGCCGCCTCCCGGGACAGTyiGATGTGCTCCAG 
GGTCCCTCTGCTGCTGCCGCTGCTCCTGCTACTGGCCCTGGGGCCTGGGGTGCAGGGCTGCCCAT 
CCGGCTGCCAGTGCAGCCAGCCACAGACAGTCTTCTGCACTGCCCGCCAGGGGACCACGGTGCCC 
CGAGACGTGCCACCCGACACGGTGGGGCTGTACGTCTTTGAGAACGGCATCACCATGCTCGACGC 
AGGCAGCTTTGCCGGCCTGCCGGGCCTGCAGCTCCTGGACCTGTCACAGAACCAGATCGCCAGCC 
TGCCCAGCGGGGTCTTCCAGCCACTCGCCAACCTCAGCAACCTGGACCTGACGGCCAACAGGCTG 
CATGAAATCACCAATGAGACCTTCCGTGGCCTGCGGCGCCTCGAGCGCCTCTACCTGGGCAAGAA 
CCGCATCCGCCACATCCAGCCTGGTGCCTTCGACACGCTCGACCGCCTCCTGGAGCTCAAGCTGC 
AGGACAACGAGCTGCGGGCACTGCCCCCGCTGCGCCTGCCCCGCCTGCTGCTGCTGGACCTCAGC 
CACAACAGCCTCCTGGCCCTGGAGCCCGGCATCCTGGACACTGCCAACGTGGAGGCGCTGCGGCT 
H GGCTGGTCTGGGGCTGCAGCAGCTGGACGAGGGGCTCTTCAGCCGCTTGCGCAACCTCCACGACC 
Q TGGATGTGTCCGACAACCAGCTGGAGCGAGTGCCACCTGTGATCCGAGGCCTCCGGGGCCTGACG 
r% CGCCTGCGGCTGGCCGGCAACACCCG'CATTGCCCAGCTGCGGCCCGAGGACCTGGCCGGCCTGGC 
7™ TGCCCTGCAGGAGCTGGATGTGAGCAACCTAA.GCCTGCAGGCCCTGCCTGGCGACCTCTCGGGCC 
'fl TCTTCCCCCGCCTGCGGCTGCTGGCAGCTGCCCGCAACCCCTTCAACTGCGTGTGCCCCCTGAGC 
TGGTTTGGCCCCTGGGTGCGCGAGAGCCACGTCACACTGGCCAGCCCTGAGGAGACGCGCTGCCA 
Lfl CTTCCCGCCCAAGAACGCTGGCCGGCTGCTCCTGGAGCTTGACTACGCCGACTTTGGCTGCCCAG 
Ql CCACCACCACCACAGCCACAGTGCCCACCACGAGGCCCGTGGTGCGGGAGCCCACAGCCTTGTCT 
r« TCTAGCTTGGCTCCTACCTGGCTTAGCCCCACAGCGCCGGCCACTGAGGCCCCCAGCCCGCCCTC 
w? CACTGCCCCACCGACTGTAGGGCCTGTCCCCCAGCCCCAGGACTGCCCACCGTCCACCTGCCTCA 
^. ATGGGGGCACATGCCACCTGGGGACACGGCACCACCTGGCGTGCTTGTGCCCCGAAGGCTTCACG 
C| GGCCTGTACTGTGAGAGCCAGATGGGGCAGGGGACACGGCCCAGCCCTACACCAGTCACGCCGAG 
!j| GCCACCACGGTCCCTGACCCTGGGCATCGAGCCGGTGAGCCCCACCTCCCTGCGCGTGGGGCTGC 
1^ AGCGCTACCTCCAGGGGAGCTCCGTGCAGCTCAGGAGCCTCCGTCTCACCTATCGCAACCTATCG 
*!! GGCCCTGATAAGCGGCTGGTGACGCTGCGACTGCCTGCCTCGCTCGCTGAGTACACGGTCACCCA 
yi GCTGCGGCCCAACGCCACTTACTCCGTCTGTGTCATGCCTTTGGGGCCCGGGCGGGTGCCGGAGG 
Q GCGAGGAGGCCTGCGGGGAGGCCCATACACCCCCAGCCGTCCACTCCAACCACGCCCCAGTCACC 
flf CAGGCCCGCGAGGGCAACCTGCCGCTCCTCATTGCGCCCGCCCTGGCCGCGGTGCTCCTGGCCGC 

GCTGGCTGCGGTGGGGGCAGCCTACTGTGTGCGGCGGGGGCGGGCCAT.GGCAGCAGCGGCTCAGG 
ACAAAGGGCAGGTGGGGCCAGGGGCTGGGCCCCTGGAACTGGAGGGAGTGAAGGTCCCCTTGGAG 
CCAGGCCCGAAGGCAACAGAGGGCGGTGGAGAGGCCCTGCCCAGCGGGTCTGAGTGTGAGGTGCC 
ACTCATGGGCTTCCCAGGGCCTGGCCTCCAGTCACCCCTCCACGCAAAGCCCTACATCTAAGCCA 
GAGAGAGACAGGGCAGCTGGGGCCGGGCTCTCAGCCAGTGAGATGGCCAGCCCCCTCCTGCTGCC 
ACACCACGTAAGTTCTCAGTCCCAACCTCGGGGATGTGTGCAGACAGGGCTGTGTGACCACAGCT 
GGGCCCTGTTCCCTCTGGACCTCGGTCTCCTCATCTGTGAGATGCTGTGGCCCAGCTGACGAGCC 
CTAACGTCCCCAGAACCGAGTGCCTATGAGGAGAGTGTCCGCCCTGCCCTCCGCAACGTGCAGTC 
CCTGGGCACGGCGGGCCCTGCCATGTGCTGGTAACGCATGCCTGGGTCCTGCTGGGCTCTCCCAC 
TCCAGGCGGACCCTGGGGGCCAGTGAAGGAAGCTCCCGGAAAGAGCAGAGGGAGAGCGGGTAGGC 
GGC TGT GTGAC T C TAGTC T TGGCCCCAGGAAGCGAAGGAACAAAAGAAAC T GGAAAGGAAGATGC 
T TTAGGAACATGT T TTGCTTTTT TAAAATATATATAT T TATAAGAGAT CC T T TCCCAT T T AT TCT 
GGGAAGATGTTTTTCAAACTCAGAGACAAGGACTTTGGTTTTTGTAAGACAAACGATGATATGAA 
GGCCTTTTGTAAGAAAAAATAAAAGATGAAGTGTGAAA 
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FIGURE 16 



MCSRVPLLLPLLLLLALGPGVQGCPSGCQCSQPQTVFCTARQGTTVPRDVPPDTVGLYVFENGIT 
MLDAGSFAGLPGLQLLDLSQNQIASLPSGVFQPLANLSNLDLTANRLHEITNETFRGLRRLERLY 
LGKNRIRHIQPGAFDTLDRLLELKLQDNELRMjPPLRLPRLLLLDLSHNSLI^EPGILDTANVE 
ALRLAGLGLQQLDEGL FSRLRNLHDLDVS DNQLERVPPVI RGLRGLTRLRLAGNTR IAQLRPEDL 
AGLAALQELDVSNLSLQALPGDLSGLFPRLRLLAAARN^ 

trchfppknagrllleldyadfgcpattttatvpttrpwreptalssslaptwlsptapateap 
sppstapptvgpvpqpqdcppstclnggtchlgtrhhlaclcpegftglycesqmgqgtrpsptp 
vtprpprsltlgiepvsptslrvglqrylqgssvqlrslrltyrnlsgpdkrlvtlrlpaslaey 
tvtqlrpnatysvcvmplgpgrvpegeeacgeahtppavhsnhapvtqaregnlplliapalaav 
llaaij^vgaaycvrrgramaaaaqdkgqvgpgagplelegvkvplepgpkategggealpsgse 
cevplmgfpgpglqsplhakpyi 

Important features: 
Signal peptide: 

amino acids 1-23 
Transmembrane domain: 

amino acids 579-599 

EGF-like domain cysteine pattern signature. 

amino acids 430-442 
Leucine zipper pattern, 
amino acids 197-219, 269-291 
N-glycosylation sites. 

amino acids 101-105, 117-121, 273-277, 500-504, 528-532 
Tyrosine kinase phosphorylation sites, 
amino acids 124-131, 337-345 
N-myristoylation sites. 

amino acids 23-29, 27-33, 70-76, 142-148, 187-193, 348-354, 
594-600, 640-646 
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FIGURE 17 

GCAGCGGCGAGGCGGCGGTGGTGGCTGAGTCCGTGGTGGCAGAGGCGAAGGCGACAGCTCATCCG 
GGTCCGGATAGGGCTGACGCTGCTGCTGTGTGCGGTGCTGCTGAGCTTGGCCTCGGCGTCCTCGG 
ATGAAGAAGGCAGCCAGGATGAAT CCT TAGAT TCCAAGACTAG TTT GACAT CAGATGAGT CAGTA 
AAGGACCATACTACTGCAGGCAGAGTAGT TGCTGGTCAAATAT TTCT TGAT TCAGAAGAATCTGA 
AT TAGAAT CCTCTAT TCAAGAAGAGGAAGACAGCC TCAAGAGCCAAGAGGGGGAAAGTGT CACAG 
AAGATAT CAGC TTT C TAGAGTC TCCAAAT CCAGAAAACAAGGACT AT GAAGAGCCAAAGAAAGTA 
CGGAAACCAGCTTTGACCGCCATTGAAGGCACAGCACATGGGGAGCCCTGCCACTTCCCTTTTCT 
, . T TTCCTAGATAAGGAGTAT GATGAAT GTACATCAGAT GGGAGGGAAGATGGCAGAC T GTGGT GT G 

f\ CTACAACCTAT GAC TACAAAGCAGAT GAAAAGT GGGGCT T TT GT GAAACT GAAGAAGAGGC TGC T 

Q AAGAGAC GGCAGATGCAGGAAGCAGAAAT GAT GTATCAAACT GGAATGAAAATCCT TAATGGAAG 

31 CAATAAGAAAAGCCAA?\AAAGAGAAGCATATCGGTATCTCC7\AAAGGCAGCAAGCATGAACCATA 

k ll CCAAAGCCCT GGAGAGAGT GTCATAT GC T CTT T TAT T T GGTGAT TAC T T GCCACAGAATAT CCAG 

Ml 

2« GCAGCGAGAGAGATGT T T GAGAAGC T GAC T GAGGAAGGC T CT CCCAAGGGACAGACT GCT CT TGG 

in CTTTCTGTATGCCTCTGGACTTGGTGTTAATTCAAGTCAGGCAAAGGCTCTTGTATATTATACAT 

B TTGGAGCTCTTGGGGGCAATCTAATAGCCCACATGGTTTTGGTAAGTAGACTTTAGTGGAAGGCT 

Q AATAAT AT TAACAT CAGAAGAATT TGTGG T T TATAGCGGCCACAAC T T T T T CAGC T T TCAT GATC 

tl CAGATTTGCTTGTATTAAGACCAAATATTCAGTTGAACTTCCTTCAAATTCTTGTTAATGGATAT 

f 1 

3* AACACATGGAATCTACAT GTAAATGAAAGT T GGTGGAGT CCACAAT T T T T C T T TAAAAT GATT AG 

Mi 

CJ TTTGGCTGATTGCCCCTAAAAAGAGAGATCTGATT^ATGGCTCTTTTTAAATTTTCTCTGAGTTG 
ill GAAT T GT CAGAATCAT T T T TTACAT TAGAT TATCATAAT T T TAAAAAT T T T TCT TTAGT T T T TCA 

AAAT T T T GTAAATGGTGGC TATAGAAAAACAACATGAAATAT TATACAATAT TTT GCAACAAT GC 
C CTAAGAAT T GTTAAAAT T CATGGAGTTATT TGTGCAGAATGAC TCCAGAGAGC T CTACTT T CT G 
TTTTTTACTTTTCATGATTGGCTGTCTTCCCATTTATTCTGGTCATTTATTGCTAGTGACACTGT 
GCCTGCTTCCAGTAGTCTCATTTTCCCTATTTTGCTAATTTGTTACTTTTTCTTTGCTAATTTGG 
AAGATTAACTCATTT TTAATAAAATTATGTCTAAGAT TAAAAAAAAAAAAAAAAAAAAAAAAAAA. 
AAAAAAAAAAA?\AAAAA?\AAAAAAAAAAA7U\AAAAAAAAAAA^^ 
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FIGURE 18 



MRVRIGLTLLLCAVLLSLASASSDEEGSQDESLDSKTTLTSDESVKDHTTAGRWAGQIFLDSEESEL 
ESSIQEEEDSLKSQEGESVTEDISFLESPNPENKDYEEPKKVRKPALTAIEGTAHGEPCHFPFLFLDK 
EYDECTSDGREDGRLWCATTYDYKADEKWGFCETEEEAAKRRQMQEA 

E AYRYLQKAASMNHTKALERVS YALLFGDYL PQNI QAAREMFEKLTEEGS PKGQTALGFL YASGLGVN 
S SQAKALVY YT FGALGGNL I AHMVLVS RL 

Important features: 
Signal peptide: 

amino acids 1-21 

N-glycosylation sites. 

amino acids 195-199, 217-221, 272-276 

yl Tyrosine kinase phosphorylation site. 

\X\ amino acids 220-228 

fli N-myristoylation sites. 

%l amino acids 120-126, 253-259, 268-274, 270-274, 285-291, 289-295 

iU 

rj Glycosaminoglycan attachment site. 

Ft J amino acids 267-271 

Microbodies C-terminal targeting signal. 

amino acids 299-303 



m 



Type II fibronectin collagen-binding domain protein. 

amino acids 127-169 

Fructose-bisphosphate aldolase class-II protein. 

amino acids 101-119 
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FIGURE 1Q 



AATTCAGATT TTAAGCCCAT TCTGCAGTGGAAT TTCATGAACTAGCAA.GAGGACACCATCT TCT T 
GTAT TATACAAGAAAGGAGT GTACC TATCACACACAGGGGGA&AAATGCT CT T T TGGGTGC TAGG 
CCTCCTAATCCTCTGTGGTTTTCTGTGGACTCGTAAAGGAAAACTAAAGATTGAAGACATCACTG 
ATAAGT ACAT T T T TAT CAC TGGATGTGAC TCGGGCT T TGGAAACT T GGCAGCCAGAACT T T TGAT 
AAAAAGGGATTTCATGTAATCGCTGCCTGTCTGACTGAATCAGGATCAACAGCTTTAAAGGCAGA 
AAC CTCAGAGAGAC T TCGTAC TGTGC T TCTGGATGTGAC CGAC CCAGAGAATGTCAAGAGGACTG 
CCCAGTGGGTGAAGAACCAAGTTGGGGAGAAAGGTCTCTGGGGTCTGATCAATAATGCTGGTGTT 
CCCGGCGT GCT GGC T CCCAC T GAC T GGC T GACAC T AGAGGAC T ACAGAG AACC TAT T GAAG TGAA 
CCT GT TT GGACTCAT CAG T GT GACAC T AAATAT GCT T CCTTTGGTCAAGAAAGC T CAAGGGAGAG 
TTATTAATGTCTCCAGTGTTGGAGGTCGCCTTGCAATCGTTGGAGGGGGCTATACTCCATCCAAA 
TATGCAGTGGAAGGTTTCAATGACAGCTTAAGACGGGACATGAAAGCTTTTGGTGTGCACGTCTC 
ATGCAT T GAACCAGGAT T GT T CAAAACAAACT TGGCAGAT CCAGTAAAGGTAAT TGAAAAAAAAC 
TCGCCAT T TGGGAGCAGC T GT C TCCAGACATCAAACAACAATATGGAGAAGGT TACAT T GAAAAA 
AG T C T AG AC AAAC T GAAAG G C AAT AAAT C C TAT G T G AAC AT GGAC CTCTCTCCGGT GG TAGAG T G 
CATGGACCACGCTCTAACAAGTCTCTTCCCTAAGACTCATTATGCCGCTGGAAAAGATGCCAAAA 
TTTTCTGGATACCTCTGTCTCACATGCCAGCAGCTTTGCAAGACTTTTTATTGTTGAAACAGAAA 
GCAGAGCTGGCTAATCCCAAGGCAGTGTGACTCAGCTAACCACAAATGTCTCCTCCAGGCTATGA 
AATTGGCCGATTTCAAGAACACATCTCCTTTTCAACCCCATTCCTTATCTGCTCCAACCTGGACT 
CATTTAGATCGTGCTTATTTGGATTGCAAAAGGGAGTCCCACCATCGCTGGTGGTATCCCAGGGT 
CCCTGCTCAAGTTTTCTTTGAAAAGGAGGGCTGGAATGGTACATCACATAGGCAAGTCCTGCCCT 
GTATTTAGGCTTTGCCTGCTTGGTGTGATGTAAGGGAAATTGAAAGACTTGCCCATTCAAAATGA 
TCTTTACCGTGGCCTGCCCCATGCTTATGGTCCCCAGCATTTACAGTAACTTGTGAATGTTAAGT 
ATCATCTCTTATCTAAATATTAAAAGATAAGTCAACCCAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAA 
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FIGURE 20 



MLFOTLGLLILCGFLWTRKGKLKIEDITDKYIFITGCDSGFGNLA?VRTFDKKGFHVIAACLTESG 
STALKAETSERLRTVLLDVTDPEWKRTAQWVKNQVGEKGLWGLINNAGVPGVIAPTDWLTLEDY 
REPIEVNLFGLISVTLNMLPLVKKAQGRVINVSSVGGRLAIVGGGYTPSKYAVEGFNDSLRRDMK 
AFGVHVSCIEPGLFKTNIADPVKVIEKKIAIWEQLSPDIKQQYGEGYIEKSLDKLKGNKSYVNMD 
LSPWECMDHALTSLFPKTHYAAGKDAKIFWIP 

Important features of the protein: 
Signal peptide: 

amino acids 1-17 

Transmembrane domain: 

amino acids 136-152 

\ N-glycosylation sites. 

amino acids 161-163, 187-190 and 253-256 

Glycosaminoglycan attachment site. 

amino acids 39-42 



N-myristoylation sites. 

amino acids 36-41, 42-47, 108-113, 166-171, 198-203 and 207-212 
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FIGURE 21 



CTGAGGCGGCGGTAGCATCGAGGGGGAGAGTACGTCGGCGGTGCTCTCGGGCTTTGTGCTCGGCG 
CACTCGCTTTCCAGCACCTCAACACGGACTCGGACACGGAAGGTTTTCTTCTTGGGGAAGTAAAA 
GGTGAAGCCAAGAACAGCAT TACTGATT CCCAAATGGAT GAT GT TGAAGT T GT T TATACAATTGA 
CATTCAGAAATATATTCCATGCTATCAGCTTTTTAGCTTTTATAATTCTTCAGGCGAAGTAAATG 
AGCAAGCACTGAAGAAAATATTATCAAATGTCAAAAA.GAATGTGGTAGGTTGGTACAAATTCCGT 
CGTCATTCAGATCAGATCATGACGT TTAGAGAGAGGCTGCTT CACAAAAAC TTGCAGGAGCATTT 
T TCAAACCAAGACC TTGT TTTTCTGCTATTAACACCAAGTATAATAACAGAAAGC TGCTCTACTC 
ATCGACT GGAACAT TC'CT TATATAAACCTCAAAAAGGACT TTT TCACAGGGTACC TT TAGTGGT T 
GCCAATCTGGGCATGTCTGAACAACTGGGTTATAAAACTGTATCAGGTTCCTGTATGTCCACTGG 
% TTT TAGCCGAGCAG TACAAACACACAGCT CT AAAT T T T T T GAAGAAGAT GGAT CCT TAAAGGAGG 

m TACATAAGAT AAAT GAAAT GTAT GC T TCAT T ACAAGAGGAAT TAAAGAGT ATAT GCAAAAAAGT G 

yj GAAGACAGTGAACAAGCAGTAGATAAACTAGTAAAGGATGTAAACAGATTAAAACGAGAAATTGA 

U1 GAAAA.GGAGAGGAGCACAGATTCAGGCAGCAAGAGAGAAGAACATCCAAAAAGACCCTCAGGAGA 

ffl 

ACATTTTTCTTTGTCAGGCATTACGGACCTTTTTTCCAAATTCTGAATTTCTTCATTCATGTGTT 

III 

ATGTCT T T AAAAAATAGACAT GTTTC TAAAAG TAGC T GTAAC TACAACCACCATC TCGAT GTAGT 
p AGACAATCTGACCTTAATGGTAGAACACACTGACATTCCTGAAGCTAGTCCAGCTAGTACACCAC 
[J] AAAT CAT TAAGCATAAAGCCTTAGAC TTAGAT GACAGAT GGCAATTCAAGAGAT CTCGGTTGT TA 

Q GATACACAAGACAAACGATCTAAAGCAAATACTGGTAGTAGTAACCAAGATAAAGCATCCAAAAT 
GAGCAGCCCAGAAACAGATGAAGAAATT GAAAAGATGAAGGGT TTT GGTGAATAT TCAC GGT CTC 
*3i C TACATT T TGATCC T T T TAACC T TACAAGGAGAT T T TT TTAT T TGGC T GAT GGGTAAAGCCAAAC 

ATTTCTATTGTTTTTACTATGTTGAGCTACTTGCAGTAAGTTCATTTGTTTTTACTATGTTCACC 
TGT TTGCAGTAATACACAGATAACTCTTAGTGCATT TACT TCACAAAGTACTTTTTCAAACATCA 
GATGC TT T TAT T TCCAAACCT T T TT T TCACCT T TCAC TAAGTT GT TGAGGGGAAGGC TTACACAG 
ACACATT C T T TAGAAT T GGAAAAGT GAGACCAGGCACAGT GGC TCACACCT GTAATCCCAGCACT 
TAGGGAAGACAAGTCAGGAGGAT TGAT T GAAGC TAGGAGT TAGAGACCAGC CTGGGCAACGT AT T 
GAGACCATGT CTAT TAAAAAATAAAAT GGAAAAGCAAGAATAGCC T TAT T T TCAAAftTATGGAAA 
GAAAT T T ATAT GAAAATT TATC TGAGT CAT TAAAAT T CT CC TTAAGTGATAC TT T T T TAGAAGTA 
CATTATGGCTAGAGTTGCCAGATAAAATGCTGGATATCATGCAATAAATTTGCAAAACATCATCT 
AAAATTTAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 22 



MEGESTSAVLSGFVLGAIAFQHLNTDSDTEGFLLGEVKGEAKNSITDSQMDDVEWYTIDIQKYI 
PCYQLFSFYNSSGEVNEQALKKILSNVKKNWGWYKFRRHSDQIMTFRERLLHKNLQEHFSNQDL 
VFLLLT PS 1 1 TE S C S THRLEHSL YKPQKGLFHRVPLWANLGMSEQLGYKT VS GS CMS T G FSRAV 
QTHSSKFFEEDGSLKEVHKINEMYASLQEELKSICKKVEDSEQAVDKLVKDVNRLKREIEKRRGA 
Q I QAAREKNI QKDPQEN I FLCQALRT FFPNSE FLHSCVMSLKNRHVS KS SCNYNHHLDWDNLTL 
MVEHTDI PEAS PAS TPQ 1 1 KHKALDLDDRWQFKRS RLLDTQDKRS KANTGS SNQDKASKMS S PET 
DEEIEKMKGFGEYSRSPTF 

Important features : 
Signal peptide: 

amino acids 1-19 

N-glycosylation sites. 

amino acids 75-79, 322-326 

N-myristoylation site, 

amino acids 184-154 

Growth factor and cytokines receptors family. 

amino acids 134-150 
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FIGURE 23 



GGCACAGC CGCGCGGCGGAGGGCAGAGT CAGC CGAGCCGAGTC CAGC CGGACGAGCGGACCAG CGCAGGGCAGCC CAA 
GCAGCGCGCAGCGAACGCCCGCCGCCGCCCACACCCTCTGCGGTCCCCGCGGCGCCTGCCACCCTTCCCTCCTTCCCC 
GCGTCCCCGCCTCGCCGGCCAGTCAGCTTGCCGGGTTCGCTGCCCCGCGAAACCCCGAGGTCACCAGCCCGCGCCTCT 
GCTTCCCTGGGCCGCGCGCCGCCTCCACGCCCTCCTTCTCCCCTGGCCCGGCGCCTGGCACCGGGGACCGTTGCCTGA 
CGCGAGGCCCAGCTCTACTTTTCGCCCCGCGTCTCCTCCGCCTGCTCGCCTCTTCCACCAACTCCAACTCCTTCTCCC 
TCCAGCTCCACTCGCTAGTCCCCGACTCCGCCAGCCCTCGGCCCGCTGCCGTAGCGCCGCTTCCCGTCCGGTCCCAAA 
GGTGGGAACGCGTCCGCCCCGGCCCGCACCATGGCACGGTTCGGCTTGCCCGCGCTTCTCTGCACCCTGGCAGTGCTC 
AGCGCCGCGCTGCTGGCTGCCGAGCTCAAGTCGAAAAGTTGCTCGGAAGTGCGACGTCTTTACGTGTCCAAAGGCTTC 
AACAAGAACGATGCCCCCCT CCACGAGATCAACGGTGATCATTTGAAGAT CTGTCCCCAGGGTT CTACCTGCTGCTCT 
js*s CAAGAGAT G GAGGAGAAGTACAGC CTGCAAAGTAAAGAT GATTT CAAAAGTGT GGTCAGC GAACAGT GCAAT CAT TT G 

ClJ CAAGCTGTCTTTGCTTCACGTTACAAGAAGTTTGATGAATTCTTCAAAGAACTACTTGAAAATGCAGAGAAATCCCTG 
Q AAT GATATGTTT GT GAAGACATAT GGC CATTTATACAT GCAAAATT CT GAGCTATTTAAAGAT CT CTT CGTAGAGTT G 

Eft AAACGTTACTAC GT GGT GGGAAAT GT GAACCT GGAAGAAAT GCT AAAT GACT T CT GGGCT CGCCT CCT GGAG C GGAT G 

LI I TTCCGCCTGGTGAACTCCCAGTACCACTTTACAGATGAGTATCTGGAATGTGTGAGCAAGTATACGGAGCAGCT GAAG 

U\ CCCTTCGGAGATGTCCCTCGCAAATTGAAGCTCCAGGTTACTCGTGCTTTTGTAGCAGCCCGTACTTTCGCTCAAGGC 
TTAGCGGTTGCGGGAGATGTCGTGAGCAAGGTCTCCGTGGTAAACCCCACAGCCCAGTGTACCCATGCCCTGTTGAAG 
if] ATGAT CTACT GCT CC CACTGCC GGGGT CT C GT GACTGT GAAGC CAT GTTACAACTACT GCT CAAACAT CAT GAGAGGC 

8 TGTT T GGC CAAC CAAGGG GATCT CGATT TT GAATGGAACAAT TT CATAGAT GCTATG CT GATGGT GGCAGAGAGGCTA 

\t»l GAGGGTCCTTTCAACATTGAATCGGTCATGGATCCCATCGATGTGAAGATTTCTGATGCTATTATGAACATGCAGGAT 
*H AATAGTGTTCAAGTGTCTCAGAAGGTTTTCCAGGGATGTGGACCCCCCAAGCCCCTCCCAGCTGGACGAATTTCTCGT 
Wi T CCAT CT CTGAAAGT GC CTT CAGTGCT CGCTT CAGAC CACAT CACCCC GAGGAACGCCCAACCACAGCAGCT G GCACT 

! U AGT TT GGACC GACTGGTTACT GATGTCAAGGAGAAACT GAAACAGGCCAAGAAATT CT GGTCCTCCCTTCCGAGCAAC 

fcf GTTT GCAACGATGAGAGGAT GGCTGCAG GAAAC GGCAATGAGGAT GACT GTT GGAATGGGAAAGGCAAAAGCAGGT AC 

\ S « CT GTTTGCAGT GACAG GAAAT GGAT TAGCCAACCAGGGCAACAACCCAGAG GT CCAGGTTGACACCAGCAAACCAGAC 

ATACT GATCCTTC GT CAAAT CAT GGCT CTT CGAGT GAT GACCAGCAAGAT GAAGAAT G CATACAAT GGGAAC GACGT G 
GACTT CT TTGATAT CAGT GAT GAAAGTAGT GGAGAAGGAAGT G GAAGT GGCT GTGAGTAT CAGCAGT GCCCTT CAGAG 
TT T GACTACAAT GCCACT GACCATGCT GGGAAGAGT GCCAAT GAGAAAGCC GACAGTGCT GGT GT CCGTC CT GGGGCA 
CAGGCCTACCTCCTCACTGTCTTCTGCATCTTGTTCCTGGTTATGCAGAGAGAGTGGAGATAATTCTCAAACTCTGAG 
AAAAAGT GTT CAT CAAAAAGTTAAAAGGCAC CAGTTAT CACTTTT CTACCAT CCTAGT GACTTTG CTTTTTAAAT GAA 
TGGACAACAATGTACAGTTTTTACTATGTGGCCACTGGTTTAAGAAGTGCTGACTTTGTTTTCTCATTCAGTTTTGGG 
AGGAAAAGGGAC TGT GCATT GAGTT GGT T CCT GCT CCCCCAAACCAT GTTAAACGT GGCTAACAGT GTAGGT ACAGAA 
CTATAGT TAGT T GT G CATTT GT GATTTTATCACTCTATTATTT GTTT GTATGTTTT TT TCTCATTT CGTTT GT GG GTT 
TTTTTTTCCAACTGTGATCTCGCCTTGTTTCTTACAAGCAAACC^GGGTCCCTTCTTGGCACGTAACATG 
TCT GAAATATTAAATAGCTGTACAGAAGCAGGT TT TATTT ATCAT GT TAT CT TATTAAAAGAAAAAGCCCAAAAAGC 
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FIGURE 2d 

MARFGLPALLCTLAVLSAALLAAELKSKSCSEVRRLYVSKGFNKNDAPLHEINGDHLKICPQGST 
CCSQEMEEKYSLQSKDDFKSWSEQCNHLQAVFASRYKKFDEFFKELLENAEKSLNDMFVKTYGH 
LYMQNSELFKDLBVELKRYYWGNVNLEEMLNDFWARLLERMFRLVNSQYHFTDEYLECVSKYTE 
QLKP FGDVPRKLKLQVTRAFVAART FAQGLAVAGDWSKVSWNP TAQCTHALLKMI YCSHCRGL 
VTWPCYNYCSNIMRGCIJySIQGDLDFEWNNFIDAMLMVAERLEGPFNIESVMDPIDW 
MQDNSVQVSQKVFQGCGPPKPLPAGRISRSISESAFSARFRPHHPEERPTTAAGTSLDRLVTDVK 
EKLKQAKKFWSSLPSOTCNDERMAAGNGNEDDCWNGKGKSRYLFAVTGNGLANQGNNPEVQVDTS 
KPD I LI LRQIMALRVMT SKMKNAYNGNDVDFFD I S DES S GEGS GS GCE YQQCPSEFDYNAT DHAG 
KSANEKADSAGVRPGAQAYLL T VFC I L FLVMQREWR 

Important features: 
Signal peptide: 

amino acids 1-22 

ATP/GTP-binding site motif A (P-loop) . 

amino acids 515-524 

N-glycosylation site. 

amino acids 514-518 

Glycosaminoglycan attachment sites . 

amino acids 494-498, 498-502 

N-myristoylation sites. 

amino acids 63-69, 224-230, 276-282, 438-444, 497-503, 531-537 
Glypicans proteins. 

amino acids 54-75, 105-157, 238-280, 309-346, 423-460, 468-506 
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FIGURE 25 



CTCGCCCTCAAATGGGAACGCTGGCCTGGGACTAAAGCATAGACCACCAGGCTGAGTATCCTGAC 
C TGAGTCATCCCCAGGGAT CAGGAGCC TCCAGCAGGGAACCT T CCAT TATATT C T TCAAGCAAC T 
TACAGCTGCACCGACAGTTGCGATGAAAGTTCTAATCTCTTCCCTCCTCCTGTTGCTGCCACTAA 
TGCTGATGTCCATGGTCTCTAGCAGCCTGAATCCAGGGGTCGCCAGAGGCCACAGGGACCGAGGC 
CAGGCT TCTAGGAGATGGCTCCAGGAAGGCGGCCAAGAAT GT GAGTGCAAAGAT TGGTTCCTGAG 
AGCCCCGAGAAGAAAAT T CAT GACAGTG TC TGGGC T GCCAAAGAAGCAGT GCCCCTGT GAT CAT T 
TCAAGGGCAAT GTGAAGAAAACAAGACAC CAAAGGCACCACAGAAAGC CAAACAAGCATT CCAGA 
GCCTGCCAGCAATTTCTCAAACAATGTCAGCTAAGAAGCTTTGCTCTGCCTTTGTAGGAGCTCTG 
AGCGCCCACTCTTCCAATTAAACATTCTCAGCCAAGAAGACAGTGAGCACACCTACCAGACACTC 
TTCTTCTCCCACCTCACTCTCCCACTGTACCCACCCCTAAATCATTCCAGTGCTCTCAAAAAGCA 
TGTTTTTCAAGATCATTTTGTTTGTTGCTCTCTCTAGTGTCTTCTTCTCTCGTCAGTCTTAGCCT 
GTGCCCTCCCCTTACCCAGGCTTAGGCTTAATTACCTGAAAGATTCCAGGAAACTGTAGCTTCCT 
AGCTAGTGTCATTTAACCT TAAATGCAAT CAGGAAAGTAGCAAACAGAAGTCAATAAATAT TTTT 
AAATGTCAAAAAAAAAAAAAAAAAA 
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FIGURE 26 



MKVLISSLLLLLPmmSMVSSSLNPGVARGHRDRGQASRRWLQEGGQECECKDWFLRAPRRKFM 
TVSGLPKKQCPCDHFKGNVKKTRHQRHHRKPNKHSRACQQFLKQCQLRSFALPL 

Important features: 
Signal peptide: 

amino acids 1-22 

N-myristoylation sites. 

amino acids 27-33, 46-52 

yj- 
\& 

ill. 



n 1 
■ ■■ 

i y 
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FIGURE 27 

GGACGCCAGCGCCTGCAGAGGCTGAGCAGGGAAAAAGCCAGTGCCCCAGCGGAAGCACAGCTCAG 
AGCTGGTCTGCCATGGACATCCTGGTCCCACTCCTGCAGCTGCTGGTGCTGCTTCTTACCCTGCC 
CCTGGACCTCATGGCTCTGCTGGGCTGCTGGCAGCCCCTGTGCAAAAGCTACTTCCCCTACCTGA 
TGGCCGTGCTGACTCCCAAGAGCAACCGCAAGATGGAGAGCAAGAAACGGGAGCTCTTCAGCCAG 
ATAAAGGGGCTTACAGGAGCCTCCGGGAAAGTGGCCCTACTGGAGCTGGGCTGCGGAACCGGAGC 
CAACTTTCAGTTCTACCCACCGGGCTGCAGGGTCACCTGCCTAGACCCAAATCCCCACTTTGAGA 
AGTTCCTGACAAAGAGCATGGCTGAGAACAGGCACCTCCAATATGAGCGGTTTGTGGTGGCTCCT 
GGAGAGGACATGAGACAGCTGGCTGATGGCTCCATGGATGTGGTGGTCTGCACTCTGGTGCTGTG 
C TC T GTGCAGAGCCCAAGGAAGGTCC T GCAGGAGGTC CGGAGAG TAC T GAGACCGGGAGGT GT GC 
TCTTTTTCTGGGAGCATGTGGCAGAACCATATGGAAGCTGGGCGTTCATGTGGCAGCAAGTTTTC 
GAGCCCACCTGGAAACACATTGGGGATGGCTGCTGCCTCACCAGAGAGACCTGGAAGGATCTTGA 
GAACGCCCAGTTCTCCGAAATCCAAATGGAACGACAGCCCCCTCCCTTGAAGTGGCTACCTGTTG 
GGCCCCACATCATGGGAAAGGCTGTCAAACAATCTTTCCCAAGCTCCAAGGCACTCATTTGCTCC 
TTCCCCAGCCTCCAATTAGAACAAGCCACCCACCAGCCTATCTATCTTCCACTGAGAGGGACCTA 
GCAGAATGAGAGAAGACATTCATGTACCACCTACTAGTCCCTCTCTCCCCAACCTCTGCCAGGGC 
AATCTCTAACTTCAATCCCGCCTTCGACAGTGAAAAAGCTCTACTTCTACGCTGACCCAGGGAGG 
AAACACTAGGACCCTGTTGTATCCTCAACTGCAAGTTTCTGGACTAGTCTCCCAACGTTTGCCTC 
CCAATGTTGTCCCTTTCCTTCGTTCCCATGGTAAAGCTCCTCTCGCTTTCCTCCTGAGGCTACAC 
CCATGCGTCTCTAGGAACTGGTCACAAAAGTCATGGTGCGTGCATCCCTGCCAAGCCCCCCTGAC 
CCTCTCTCCCCACTACCACCTTCTTCCTGAGCTGGGGGCACCAGGGAGAATCAGAGATGCTGGGG 
ATGCCAGAGCAAGAC T CAAAGAGGCAGAGGTT T TGT TCT CAAATAT T T T T T AATAAATAGACGAA 
ACCACG 
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FIGURE 28 



m 
m 



ill 



MDILVPLLQLLVLLLTLPLHIJyiALLGCWQPLCKSYFPYLMAVLTPKSNRKMESKKRELFSQIKGL 

TGASGKVALLELGCGTGANFQFYPPGCRVTCLDPNPHFEKFLTKSMAENRHLQYERFYVAPGEDM 

RQLADGSMDVWCTLVLCSVQSPRKVLQEVRRVXRPGGVLFFWEHVAEPYGSW 

KHI GDGCCLTRETWKDLENAQFSE IQMERQPPPLKWLPVGPHIMGKAVKQS FPS SKALICS FPS L 

QLEQATHQPIYLPLRGT 

Important features : 
Signal peptide: 

amino acids 1-23 

Leucine zipper pattern. 

amino acids 10-32 

N-myristoylation sites. 

amino acids 64-70, 78-84, 80-86, 91-97, 201-207 
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FIGURE 2Q 



CAATGTTTGCCTATCCACCTCCCCCAAGCCCCTTTACCTATGCTGCTGCTAA.CGCTGCTGCTGCT 
GCTGCTGCTGCTTAAAGGCTCATGCTTGGAGTGGGGACTGGTCGGTGCCCAGAAAGTCTCTTCTG 
CCACTGACGCCCCCATCAGGGATTGGGCCTTCTTTCCCCCTT.CCTTTCTGTGTCTCCTGCCTCAT 
CGGCCTGCCATGACCTGCAGCCAAGCCCAGCCCCGTGGGGAAGGGGAGAAAGTGGGGGATGGCTA 
AGAAAGC T GGGAGATAGGGAACAGAAGAGGG TAGT GGGT GGGC TAGGGGGGCT GCCT TAT T TAAA 
GTGGTTGTT TATGAT T C T TAT AC TAAT T T AT ACAAAG AT AT T AAG GC CC T GT T CAT T AAGAAAT T 
GTTCCCTTCCCCTGTGTTCAATGTTTGTAAAGATTGTTCTGTGTAAATATGTCTTTATAATAAAC 
AGT TAAkAGC T GAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 3Q 



MLLLTLLLLLLLLKGSCLSWGLVGAQKVSSATDAPIRDWAFFPPSFLCLLPHRPAMTCSQAQPRG 
EGEKVGDG 

Important features: 
Signal peptide: 

amino acids 1-15 

Growth factor and cytokines receptors family: 

^ amino acids 3-18 

a 
a 
m 

LI! 

m 
m 

G 

m 
a 
m 
ci 
m 
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FIGURE ai 



GTTTGAATTCCTTCAACTATACCCACAGTCCAAAAGCAGACTCACTGTGTCCCAGGCTACCAGTT 
CCTCCAAGCAAGTCATTTCCCTTATTTAACCGATGTGTCCCTCAAACACCTGAGTGCTACTCCCT 
ATTTGCATCTGTTTTGATAAATGATGTTGACACCCTCCACCGAATTCTAAGTGGAATCATGTCGG 
GAAGAGATACAATCCTTGGCCTGTGTATCCTCGCATTAGCCTTGTCTTTGGCCATGATGTTTACC 
TTCAGATTCATCACCACCCTTCTGGTTCACATTTTCATTTCATTGGTTATTTTGGGATTGTTGTT 
TGTCTGCGGTGTTTTATGGTGGCTGTATTATGACTATACCAACGACCTCAGCATAGAATTGGACA 
CAGAAAGGGAAAATATGAAGTGCGTGCTGGGGTTTGCTATCGTATCCACAGGCATCACGGCAGTG 

; . CTGCTCGTCTT GAT T T T T GTT CT CAGAAAGAGAATAAAAT T GACAGT TGAGCT T T TCCAAAT CAC 

pi 

Q AAATAAAGCCATCAGCAGTGCTCCCTTCCTGCTGTTCCAGCCACTGTGGACATTTGCCATCCTCA 
Q TTTTCTTCTGGGTCCTCTGGGTGGCTGTGCTGCTGAGCCTGGGAACTGCAGGAGCTGCCCAGGTT 
^ ; ATGGAAGGCGGCCAAGTGGAATATAAGCCCCTTTCGGGCATTCGGTACATGTGGTCGTACCATTT 
AATTGGCCTCATCTGGACTAGTGAATTCATCCTTGCGTGCCAGCAAATGACTATAGCTGGGGCAG 

m 

f$z TGGTTACTTGTTATTTCAACAGAAGTAAAAATGATCCTCCTGATCATCCCATCCTTTCGTCTCTC 
iji TCCATTCTCTTCTTCTACCATCAAGGAACCGTTGTGAAAGGGTCATTTTTAATCTCTGTGGTGAG 
g GAT TCCGAGAAT CATT GT CAT GTACAT GCAAAACGCACTGAAAGAACAGCAGCATGGT GCAT T GT 

Q CCAGGTACCTGTTCCGATGCTGCTACTGCTGTTTCTGGTGTCTTGACAAATACCTGCTCCATCTC 
j;!] AACCAGAATGCATATACTACAACTGC TATTAATGGGACAGATT TCTGTACAT CAGCAAAAGATGC 

X' AT TCAAAATCT T GT CCAAGAAC TCAAGTCACT T TACAT C TATTAACT GCT T T GGAGACT TCATAA 

CJ TTTTTCTAGGAAAGGTGTTAGTGGTGTGTTTCACTGTTTTTGGAGGACTCATGGCTTTTAACTAC 
fSj AATCGGGCATTCCAGGTGTGGGCAGTCCCTCTGTTATTGGTAGCTTTTTTTGCCTACTTAGTAGC 

CCATAGTTTTTTATCTGTGTTTGAAACTGTGCTGGATGCACTTTTCCTGTGTTTTGCTGTTGATC 
TGGAAACAAATGATGGATCGTCAGAAAAGCCCTACTTTATGGATCAAGAATTTCTGAGTTTCGTA 
AAAAGGAGCAACAAATTAAACAATGCAAGGGCACAGCAGGACAAGCACTCATTAAGGAATGAGGA 
GGGAACAGAAC TCCAGGCCAT T GTGAGATAGATACCCAT TT AGGT AT C TGTACCTGGAAAACAT T 
TCCTTCTAAGAGCCATTTACAGAATAGAAGATGAGACCACTAGAGAAAAGTTAGTGAATTTTTTT 
TTAAAAGACCTAATAAACCCTATTCTTCCTCAAAA 
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FIGURE 32 

MSGRDTILGLCILALALSLA^FTFRFITTLLVHIFISLVILGLLFVCGVLWWLYYDYTNDLSIE 
LDTERENMKCVLGFAIVSTGITAVLLVLIFVLRKRIKLTVELFQITNKAISSAPFLLFQPLWTFA 
ILIFFWVLWAVLLSLGTAGAAQVMEGGQVEYKPLSGIRYMWSYHLIGLIWTSEFILACQQMTIA 
GAWTC Y FNRSKNDP PDH PILSSLSI L FFYHQGT WKGS FL I SWRI PRI I VMYMQNALKE QQHG 
ALSRYLFRCCYCCFWCLDKYLLHLNQNAYTTTAINGTDFCTSAKDAFKILSKNSSHFTSINCFGD 
FI I FLGKVLWCFTVFGGLMAFNYNRAFQVWAVPLLLVAFFAYLVAHS FLSVFETVLDALFLCFA 
VDLETNDGSSEKPYFMDQEFLSFVKRSNKLNNARAQQDKHSLRNEEGTELQAIVR 

Important features : 
Signal peptide: 

amino acids 1-20 

Putative transmembrane domains : 

amino acids 35-54, 75-97, 126-146, 185-204, 333-350, 352-371 

N-glycosylation sites. 

amino acids 204-208, 295-299, 313-317 

N-myristoylation sites. 

amino acids 147-153, 178-184, 196-202, 296-275, 342-348 
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FIGURE 33 



GTT CGAT T AGC TCCT CT GAGAAGAAGAGAAAAGGT T CTT GGACC TC TCCCTGTTTCTTCCT TAGA 
ATAAT T T G TAT GGGAT T T G T GAT GCAG GAAAGC C T AAGGGAAAAAGAAT AT T CAT TCTGTGTGGT 
GAAAATT T T T T GAAAAAAAAAT TGCCT TCT TCAAACAAGGG TGTCAT TCTGATAT T TATGAGGAC 
TGTTGTTCTCACTATGAAGGCATCTGTTATTGAAATGTTCCTTGTTTTGCTGGTGACTGGAGTAC 
AT TCAAACAAAGAAACGGCAAAGAAGAT TAAAAGGCC CAAG T TCAC T GTGCCTCAGAT CAACT GC 
GATGTCAAAGCCGGAAAGATCATCGATCCTGAGTTCATTGTGAAATGTCCAGCAGGATGCCAAGA 
CCCCAAATACCATGTTTATGGCACTGACGTGTATGCATCCTACTCCAGTGTGTGTGGCGCTGCCG 
TACACAGT GGT GTGC T TGATAAT TCAGGAGGGAAAATAC T T GT TCGGAAGGT TGC TGGACAGT CT 
GGTTACAAAGGGAGTTATTCCAACGGTGTCCAATCGTTATCCCTACCACGATGGAGAGAATCCTT 
TATCGTCTTAGAAAGTAAACCCAAAAAGGGTGTAACCTACCCATCAGCTCTTACATACTCATCAT 
CGAAAAGT CCAGCTGCCCAAGCAGGTGAGACCACAAAAGCCTATCAGAGGCCACCTATT CCAGGG 
ACAACTGCACAGCCGGTCACTCTGATGCAGCTTCTGGCTGTCACTGTAGCTGTGGCCACCCCCAC 
CACCTTGCCAAGGCCATCCCCTTCTGCTGCTTCTACCACCAGCATCCCCAGACCACAATCAGTGG 
GCCACAGGAGCCAGGAGATGGATCTCTGGTCCACTGCCACCTACACAAGCAGCCAAAACAGGCCC 
Q AGAGC TGATCCAGGTAT CCAAAGGCAAGAT CC T TCAGGAGCTGC CT T C CAGAAAC C T GT TGGAGC 

Q) GGAT GTCAGCC T GGGACT T GT TCCAAAAGAAGAATTGAGCACACAGTCT T TGGAGCCAGTAT CCC 

TGGGAGAT CCAAACT GCAAAAT TGAC T T GT CGT T TT T AAT T GAT GGGAGCACCAGCAT T GGCAAA 
CGGCGATTCCGAATCCAGAAGCAGCTCCTGGCTGATGTTGCCCAAGCTCTTGACATTGGCCCTGC 
CGGTCCAC T GAT GGGT GTT GTCCAGTAT GGAGACAACCCT GCTACTCACT T TAACCT CAAGACAC 
II ACACGAAT TCTCGAGATCTGAAGACAGCCATAGAGAAAAT TACTCAGAGAGGAGGACTT TCTAAT 

|1 GTAGGTCGGGCCATCTCCTTTGTGACCAAGAACTTCTTTTCCAAAGCCAATGGAAACAGAAGCGG 
GGCTCCCAATGTGGTGGTGGTGATGGTGGATGGCTGGCCCACGGACAAAGTGGAGGAGGCTTCAA 
GACT TGCGAGAGAGT CAGGAATCAACAT TT TC T TCAT CACCAT TGAAGGT GCTGC TGAAAATGAG 
j.| AAGCAGTATGTGGTGGAGCCCAACTTTGCAAACAAGGCCGTGTGCAGAACAAACGGCTTCTACTC 
|| GCTCCACGTGCAGAGCTGGTTTGGCCTCCACAAGACCCTGCAGCCTCTGGTGAAGCGGGTCTGCG 
:1 ACACTGACCGCCTGGCCTGCAGCAAGACCTGCTTGAACTCGGCTGACATTGGCTTCGTCATCGAC 
GGCTCCAGCAGTGTGGGGACGGGCAACTTCCGCACCGTCCTCCAGTTTGTGACCAACCTCACCAA 
AGAGTTTGAGATTTCCGACACGGACACGCGCATCGGGGCCGTGCAGTACACCTACGAACAGCGGC 
TGGAGTTTGGGTTCGACAAGTACAGCAGCAAGCCTGACATCCTCAACGCCATCAAGAGGGTGGGC 
TACTGGAGTGGTGGCACCAGCACGGGGGCTGCCATCAACTTCGCCCTGGAGCAGCTCTTCAAGAA 
G TCCAAGCCCAACAAGAGGAAGT TAATGAT CC TCAT CACCGACGGGAGGT CC TACGACGACGT CC 
GGATCCCAGCCATGGCTGCCCATCTGAAGGGAGTGATCACCTATGCGATAGGCGTTGCCTGGGCT 
GCCCAAGAGGAGCTAGAAGTCATTGCCACTCACCCCGCCAGAGACCACTCCTTCTTTGTGGACGA 
GTTTGACAACCTCCATCAGTATGTCCCCAGGATCATCCAGAACATTTGTACAGAGTTCAACTCAC 
AGCC TCGGAAC TGAAT TCAGAGCAGGCAGAGCACCAGCAAGTGC TGC T T TACTAACTGACGT GT T 
GGACCACCCCACCGCTTAATGGGGCACGCACGGTGCATCAAGTCTTGGGCAGGGCATGGAGAAAC 
AAATGTCTTGTTATTATTCTTTGCCATCATGCTTTTTCATATTCCAAAACTTGGAGTTACAAAGA 
TGATCACAAACGTATAGAAT GAGCCAAAAGGC TACATCAT GT TGAGGGTGC TGGAGATT TTACAT 
TTTGACAATTGTTTTCAAAATAAATGTTCGGAATACAGTGCAGCCCTTACGACAGGCTTACGTAG 
AGCTTTTGTGAGATTTTTAAGTTGTTATTTCTGATTTGAACTCTGTAACCCTCAGCAAGTTTCAT 
T T T TGTCAT GACAATGTAGGAAT TGCT GAAT TAAAT GTT TAGAAGGAT GAAAAATAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^^ 
AAAAAAAAAAAAAAAAAAAAAAAAAAZUIAAAAAAAAAAAAAAG 
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FIGURE 34 

MRTWLTMKASVIEMFLVLLVTGVHSNKETAKKIKRPKFTVPQINCDVKAGKIIDPEFIVKCPAG 
CQDPKYHVYGTDVYASYSSVCGAAVHSGVLDNSGGKILVRKVAGQSGYKGSYSNGVQSLSLPRWR 
ESFIVLESKPKKGVTYPSALTYSSSKSPAAQAGETTKAYQRPPIPGTTAQPVTLMQLLAVTVAVA 
TPTTLPRPSPSAASTTSIPRPQSVGHRSQEMDLWSTATYTSSQNRPRADPGIQRQDPSGAAFQKP 
VGADVSLGLVPKEELSTQSLEPVSLGDPNCKI DLS FLIDGSTS IGKRRFRI QKQLLADVAQALDI 
GPAGPLMGWQYGDNPATHFNLKTHTNSRDLKTAIEKITQRGGLSNVGRAISFVTKNFFSKANGN 
RSGAPOTWVMVDGWPTDKVEEASRLARE SGINI FFI T IEGAAENEKQYWEPNFANKAVCRTNG 
FYSLHVQSWFGLHKTLQPLVKRVCDTDRLACSKTCLNSADIGFVIDGSSSVGTGNFRTVLQFVTN 
LTKEFEISDTDTRIGAVQYTYEQRLEFGFDKYSSKPDILNAIKRVGYWSGGTSTGAAINFALEQL 
FKKS KPNKRKLMI L I TDGRS YDDVR I PAMAAHLKGVI TYAI GVAWAAQEELEVI ATHPARDHS FF 
VDE FDNLHQYVPR I IQNICTE FNSQPRN 

Important features: 
Signal peptide: 

amino acids 1-26 

Transmembrane domain: 

amino acids 181-200 

N-glycosylation sites. 

amino acids 390-394, 520-524 

N-myristoylation sites. 

amino acids 23-29, 93-99, 115-121, 262-268, 367-373, 389-395, 
431-437, 466-472, 509-515, 570-576, 571-577, 575-581, 627-633 

Amidation site. 

amino acids 304-308 
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FIGURE 3 5 



CCGAGCACAGGAGATTGCCTGCGTTTAGGAGGTGGCTGCGTTGTGGGAAAAGCTATCAAGGAAGAAATTGC 
CAAACCATGTCTTTTTTTCTGTTTTCAGAGTAGTTCACAACAGATCTGAGTGTTTTAATTAAGCATGGAAT 
ACAGAAAACAACAAAAAACTTAAGCTTTAATTTCATCTGGAATTCCACAGTTTTCTTAGCTCCCTGGACCC 
GGTTGACCTGTTGGCTCTTCCCGCTGGCTGCTCTATCACGTGGTGCTCTCCGACTACTCACCCCGAGTGTA 
AAGAACCTTCGGCTCGCGTGCTTCTGAGCTGCTGTGGATGGCCTCGGCTCTCTGGACTGTCCTTCCGAGTA 
GGAT GT CACT GAGATCCCTC AAAT GG AGC C TCCTGCTG CTGT CACTCC TGAGT T TC TT TGTGAT GTGGT AC 
CTC AGC CTT CCCC ACT AC AAT GTGAT AGAACGCGTGAACTGGAT GT ACTT CT AT GAGT AT GAGCCG AT T T A 
CAGACAAGACTTTCACTTCACACTTCGAGAGCATTCAAACTGCTCTCATCAAAATCCATTTCTGGTCATTC 
TGGTGACCTCCCACCCTTCAGATGTGAAAGCCAGGCAGGCCATTAGAGTTACTTGGGGTGAAAAAAAGTCT 
TGGT GG GGAT AT GAGGT T CT T ACATT TTTCTTATT AGGCC AAGAGGCT GAAAAGGAAG AC AAAATGTT GGC 
ATTGTCCTTAGAGGATGAACACCTTCTTTATGGTGACATAATCCGACAAGATTTTTTAGACACATATAATA 
ACCT GAC C T TGAAAAC C ATT AT GGCAT T CAGGT GGGTAACT G AGTTTTGCCCCAAT GCC AAGT ACGTAAT G 
AAGAC AGAC ACT GATGTT TTCATC AATACT GGCAAT TT AGT G AAGTAT CT TTT AAACCT AAACC ACTCAGA 
GAAGTTTTTCACAGGTTATCCTCTAATTGATAATTATTCCTATAGAGGATTTTACCAAAAAACCCATATTT 

CTT ACC AGGAGT ATCCTT TC AAGGTGT TCC CT CC AT ACTGC AGT GGGTTGGGT T AT AT AATGTCCAGAGAT 

W l 

I fi TTGGTGCCAAGGATCTAT GAAATGAT GGGT CACGT AAAACCC AT CAAGTT T GAAGATGTT T ATGTC GGGAT 

*sst 

CT GT TT GAATTT AT TAAAAGT GAACAT T CAT ATTCC AGAAGACAC AAATCTTTT CTTTCT AT AT AGAATCC 

S3 

A ATTTGGATGTCTGTCAACTGAGACGTGTGATTGCAGCCCATGGCTTTTCTTCCAAGGAGATCATCACTTTT 

T GGC AGGTCATGCT AAGGAACACCAC AT GC CAT T AT TAACTT CACAT T CT ACAAAAAGCC TAGAAGGACAG 
GAT ACCTTGTGGAAAGTGTT AAAT AAAGTAGGT ACT GTGGAAAATTCATGGGGAGGTCAGTGTGCTGGCTT 
I'M ACAC TGAACT GAAACT CATG AAAAACCC AGACTGGAGACTGGAGGGT T AC ACTT GT GATT TAT T AGTC AGG 

k - CCCTTCAAAGATGATATGTGGAGGAATTAAATATAAAGGAATTGGAGGTTTTTGCTAAAGAAATTAATAGG 

n \ 

]% * ACCAAACAATTTGGACATGTCATTCTGTAGACTAGAATTTCTTAAAAGGGTGTTACTGAGTTATAAGCTCA 

CT AGGC TGTAAAAAC AAAAC AAT GTAGAGT TT T AT T TAT TGAACAATGT AGT CACT T GAAGGT TT TGT GT A 
TATCTTATGTGGATTACCAATTTAAAAATATATGTAGTTCTGTGTCAAAAAACTTCTTCACTGAAGTTATA 
CT GAAC AAAATT TT ACCT GTTT T T GGTC AT TT AT AAAGT ACT TCAAGATGT TGCAGT ATT TCAC AGTT ATT 
ATTATTTAAAATTACTTCAACTTTGTGTTTTTAAATGTTTTGACGATTTCAATACAAGATAAAAAGGATAG 
TGAATC ATT C TT T AC ATGCAAAC ATT TT CC AGTT ACTT AACT GATCAGTT T ATT AT TG AT ACATCACT CC A 
TTAATGTAAAGTCATAGGTCATTATTGCATATCAGTAATCTCTTGGACTTTGTTAAATATTTTACTGTGGT 
AAT ATAGAGAAG AATT AAAGCAAG AAAATCT GAAAA 
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FIGURE 36 



MASALWTVLPSRMSLRSLKWSLLLLSLLSFFVMWYLSLPHYOTIERVNWM 

TLREHSNCSHQNPFLVILVTSHPSDVKARQAIRVTWGEKKSWWGYEVLTFFLLGQEAEKEDKMLA 
LSLEDEHLLYGDIIRQDFLDTYNNLTLKTIMAFRWVTEFCPNAKYVMKTDTDVFINTGNLVKYLL 
NLl^HSEKFFTGYPLIDNYSYRGFYQKTHISYQEYPFKVFPPYCSGLGYIMSRDLVPRIYEMMGHV 
KPIKFEDVYVGICLNLLKVNIHIPEDTNLFFLYRIHLDVCQLRRVIAAHGFSSKEIITFWQVMLR 
NTTCHY 

Important features: 

Type II transmembrane domain: 

amino acids 20-39 

N-glycosylation sites. 

amino acids 72-76, 154-158, 198-202, 212-216, 326-330 

Glycosaminoglycan attachment site. 

amino acids 239-243 

Ly-6 / u-PAR domain proteins. 

amino acids 23-37 

N-myristoylation site. 

amino acids 271-277 
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FIGURE 3Z 



CGCTCGGGCACCAGCCGCGGCAAGGAT6GAGCTGGGTTGCTGGACGCAGTTGGGGCTCACTTTTCTTCAGCTCCTTCTCATC 
T CGTCCT T GCCAAGAGAGTACAC AGTCAT TAATGAAGCC T GCCC TGGAGCAGAGTGGAATATCATGTGTCGGGAGTGC TGT G 
AATATGAT C AGATTGAGTGCGTC TGCCC C GGAAAGAGGGAAGTCGTGGGT TATACC ATC C C TTGC TGCAGGAATGAGGAGAA 
T GAGTGT GAC T C C TGCCTGAT CC ACCCAGGT TGTAC CATC TT TGAAAAC TGCAAGAGCT GC CGAAATGGCT CATGGGGGGGT 
ACC T TGGAT GAC TTC TATGTGAAGGGGT T CTAC TG TGCAGAGTGCCGAGCAGGCTGGTAC GGAGGAGACTGCATGC GATGTG 
GCC AGGT TC TGC GAGCCCCAAAGGGTC AGAT TT TGT TGGAAAGC TATCC CC TAAAT GC TCACTGTGAATGGAC CATTCATGC 
TAAACCTGGGTT TGTCATCC AAC T AAGAT TT GT CATGTTGAGTC TGGAGT T T GAC TACATGTGCCAGTATGACTATGT TGAG 
GTTCGTGATGGAGACAACCGCGATGGCCAGATC AT CAAGCGTGTC TGTGGCAACGAGCGGC CAGC TCC TATCCAGAGCATAG 
GATCCTCACTCCACGTCCTCTTCCACTCCGATGGCTCCAAGAATTTTGACGGTTTCCATGCCATTTATGAGGAGATCACAGC 
ATGCTCCTCATCCCCTTGTTTCCATGACGGCACGTGCGTCCTTGACAAGGCTGGATCTTACAAGTGTGCCTGCTTGGCAGGC 
TAT AC TGGGCAGCGC T GTGAAAATC TCC T TGAAGAAAGAAACTGC TC AGAC C C TGGGGGCCCAGT CAATGGGTAC C AGAAAA 
TAACAGGGGGCCCTGGGCTTATCAACGGACGCCATGCTAAAATTGGCACCGTGGTGTCTTTCTTTTGTAACAACTCCTATGT 
T C T TAGTGGCAATGAGAAAAGAAC TTGCCAGCAGAATGGAGAGTGGTC AGGGAAAC AGCCC ATC TGCATAAAAGC C TGC C GA 
GAACCAAAGAT T TCAGACC TGGTGAGAAGGAGAGT T C TTC C GATGCAGG TTC AGTC AAGGGAGAC ACCATTAC AC CAGC TAT 
ACTCAGCGGCCTTCAGCAAGCAGAAACTGCAGAGTGCCCCTACCAAGAAGCCAGCCCTTCCCTTTGGAGATCTGCCCATGGG 
ATACCAACATCTGCATACCCAGCTCCAGTATGAGTGCATCTCACCCTTCTACCGCCGCCTGGGCAGCAGCAGGAGGACATGT 
C TGAGGAC TGGGAAGTGGAGT GGGCGGGCACCATC C TGCATCCC TATC TGC GGGAAAAT TGAGAACATCAC TGCTCCAAAGA 
CCCAAGGGT TGC GCTGGCCGT GGCAGGCAGC CATC T ACAGGAGGACCAGCGGGGT GC ATGACGGCAGC CTACACAAGGGAGC 
GTGGTTCCTAGTCTGCAGCGGTGCCCTGGTGAATGAGCGCACTGTGGTGGTGGCTGCCCACTGTGTTACTGACCTGGGGAAG 
GTCAC CATGATC AAGACAGCAGAC CTGAAAGTTGT T T TGGGGAAATTC TAC CGGGAT GATGACCGGGATGAGAAGAC CATCC 
AGAGCCTAGAGAT TTC TGC TATCATTC TGCATCCCAACTATGACCCCAT CCTGCT TGATGC TGACATCGCCATCC TGAAGC T 
CCTAGACAAGGCCCGTATCAGCACCCGAGTCCAGCCCATCTGCCTCGCTGCCAGTCGGGATCTCAGCACTTCCTTCCAGGAG 
TCCCACATCACTGTGGC TGGC TGGAATGTCC TGGCAGACGT GAGGAGC C C TGGCTTC AAGAAC GACACACTGCGC TC TGGGG 
T GGTC AGT GTGGTGGAC TCGC TGC TGTGT GAGGAGC AGCATGAGGAC CAT GGCATCCCAGTGAGT GTCACTGATAACATGTT 
C TGTGCC AGC TGGGAAC CCAC TGCCCC T T CTGATAT CTGC ACTGC AGAGACAGGAGGCATCGC GGC TGTGTCC TT CC CGGGA 
C GAGC ATC TCCT GAGCC ACGC TGGCATC T GATGGGAC TGGTCAGC TGGAGC TATGATAAAACAT GCAGCCACAGGCT C TCCA 
C TGCC TTC ACCAAGGTGCTGC C T T TTAAAGACTGGAT TGAAAGAAAT AT GAAAT GAACC ATGC T C ATGCACTCCT TGAGAAG 
TGTTTCTGTATATCCGTCTGTACGTGTGTCATTGCGTGAAGCAGTGTGGGCCTGAAGTGTGATTTGGCCTGTGAACTTGGCT 
GTGCC AGGGCTT C TGAC TTCAGGGACAAAAC TCAGTGAAGGGTGAGTAGACC TCC AT TGCT GGTAGGC TGATGCCGC GTCCA 
C TAC TAGGACAGC CAAT TGGAAGATGCCAGGGC T TGCAAGAAGTAAGT T TC T TCAAAGAAGACCATATACAAAACC TCT C CA 
C TCCACTGACCT GGTGGTC T TCCCCAAC T TT CAGT TATAC GAATGCC ATCAGCT TGACC AGGGAAGATC TGGGCT TC AT GAG 
GCCCCTTTTGAGGCTCTCAAGTTCTAGAGAGCTGCCTGTGGGACAGCCCAGGGCAGCAGAGCTGGGATGTGGTGCATGCCTT 
T GTGT ACATGGC CACAGTAC AGTC TGGTCCT TT TC C TTCC C CATC TC T TGTACACATTT TAATAAAATAAGGGTTGGCT TCT 
GAAC TAC AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 38 

MELGCWTQLGLTFLQLLLISSLPREYTVINEACPGAEWNIMCRECCEYDQIECVCPGKREWGYT 
IPCCRNEENECDSCLIHPGCTIFENCKSCRNGSWGGTLDDFYVKGFYCAECRAGWYGGDCMRCGQ 
VLRAPKGQ I LLES YPLNAHCE WT I HAKPG FVI QLRFVMLS LE FD YMCQ YDYVEVRDGDNRDGQ 1 1 
KRVCGNERPAPIQSIGSSLHVLFHSDGSKNFDGFHAIYEEITACSSSPCFHDGTCVLDKAGSYKC 
ACLAGYTGQRCENLLEERNCSDPGGPVNGYQKITGGPGLINGRHAKIGTWSFFCNNSYVLSGNE 
KRTCQQNGEWSGKQPICIKACREPKISDLVRRRVLEMQVQSRETPLHQLYSAAFSKQKLQSAPTK 
KPALPFGDLPMGYQHLHTQLQYECISPFYRRLGSSRRTCLRTGKWSGRAPSCIPICGKIENITAP 
KTQGLRWPWQAAIYRRTSGVHDGSLHKGAWFLVCSGALVNERTVVVAAHCVTDLGKVTMIKTADL 
KWLGKFYRDDDRDEKT I QSLQ I S AI I LHPNYDP ILLDAD I AI LKLLDKAR I S TRVQP I CLAAS R 
DLSTS FQESHI TVAGWNVLADVRS PGFKNDTLRSGWSWDSLLCEEQHEDHGI PVSVTDNMFCA 
SWEPTAPSDICTAETGGI7^VSFPGRASPEPRWHLMGLVSWSYDKTCSHRLSTAFTKVLPFKDWI 
ERNMK 

Important features of the protein: 
Signal peptide: 

amino acids 1-23 

EGF-like domain cysteine pattern signature, 
amino acids 2 60-272 
N-glycosylation sites. 

amino acids 96-100, 279-283, 316-320, 451-455, 614-618 
N-myristoylation sites. 

amino acids 35-41, 97-103, 256-262, 284-290, 298-304, 308-314, 

474-480, 491-497, 638-644, 666-672 

Amidation site. 

amino acids 56-60 

Serine proteases, trypsin family. 

amino acids 489-506 

CUB domain proteins profile. 

amino acids 150-167 
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FIGURE 3Q 



GGT T C CTACAT C CT CT CAT CT GAGAAT CAGAGAGCATAAT CT T CT TAC GGGCCCGT GATT TAT TAAC GTGGCTTAAT C 
T GAAGGTT CT CAGT CAAAT T CT T T GTGATCTACT GAT T GT GGGGGCAT GGCAAGGT TT GCTTAAAGGAGCT T GGCT GG 
TT T GGGCC CT TGTAGCT GACAGAAGGT GGCCAGGGAGAAT GCAGCACACT GCT CGGAG AATGA AGGC GCTT CT GT T GC 
T GGTCT T GCCTT GGCT CAGT C CT GCTAACTACATTGACAAT GT GGGCAAC CT GCACTTCCT GTATTCAGAACT CT GT A 
AAGGT GCCTCCCACTAC GGCCT GACCAAAGATAGGAAGAGGCGCT CACAAGAT GGCTGTCCAGACG GCT GT GCGAGCC 
TCACAGCCAC GG CTCCCTCC CCAGAGGT TT CT GCAGCT GCCAC CAT CT CCTTAAT GACAGAC GAGC C TGGC CTAGACA 
ACC CT GCCTACGT GT CCTCGGCAGAGGACGGGCAGC CAGCAAT CAGC CCAGT GGACTCTGG C CGGAGCAAC CGAACTA 
GGGCACGGCCCTT T GAGAGAT C CACTAT TAGAAGCAGATCAT T TAAAAAAATAAAT CGAGCT TTGAGT GTT CTTC GAA 
GGACAAA.GAGCGGGAGT GCAGTT GCCAACCAT GCCGACCAGG GCAG GGAAAATTCT GAAAACACCACT GC CCCT GAAG 
TCTTT CCAAGGT TGTACCAC CT GATTC CAGAT GGT GAAATTACCAGCAT CAAGAT CAATC GAGTAGATCC CAGTGAAA 
GCCTCTCTATTAGGCTGGTGGGAGGTAGCGAAACCCCACTGGTCCATATCATTATCCAAC^CATTTATCGTGATGGGG 
TGATCGCCAGAGACGGCCGGCTACTGCCAGGAGACATCATTCTAAA^^ 

ACAACTACGCT GT GC GT CTCCT GCGGCAGCC CT GCCAGGT GCT GT GGCTGACT GT GAT GC GT GAACAGAAGTT CCGCA 
GCAGGAACAAT GGACAGGCC CC GGAT GCCTACAGACCCCGAGATGACAGCTT T CAT GT GAT T CTCAACAAAAGTAGCC 
C C GAGGAGCAGCTTGGAATAAAACT GGT GCGCAAGGT GGAT GAGC CTGGGGTTTT CAT CTT CAAT GT GCT GGATGGC G 
GTGTGGCATATCGACATGGTCAGCTTGAGGAGAATGACCGTGTGTTAGCCATC^ATGGACATGATCTTCGATATGGCA 
GCCCAGAAAGT GC GGCT CAT CT GATT CAGGC CAGT GAAAGACGT GT T CACCT C GT C GTGT CCC GCCAGGTT C GGCAGC 
GGAGC CCT GACAT CT TT CAGGAAGC CGGCT GGAACAGCAAT GGCAGCT GGT CC CCAGGGCCAGGGGAGAGGAGCAACA 
*j CTC CCAAGCC CCT CCAT CCT ACAATTACTT GT CAT GAGAAGGT GGTAAATAT C CAAAAAGACCCC GGT GAAT CTCTC G 

W GCATGACCGTCGCAGGGGGAGCATCACATAGAGAATGGGATTTGCCTATCTATGTCATCAGTGTTGAGCCCGGAGGAG 

TCATAAGCAGAGATGGAAGAATAAAAACAGGTGACATTTTGTTGAATGTGGATGGGGTCGAACTGACAGAGGTCAGCC 
■™* GGAGT GAGGCAGT GGCAT TATT GAAAAGAACAT CAT CCTC GATAGT ACTCAAA.GCT TT GGAAGTCAAAGAGTATGAGC 

* ■? f C C CAGGAAGACTGCAGCAGCC CAGCAGC CCT GGACTCCAAC CACAACATGGCCCCACCCAGTGACTGGT CC CCAT CCT 

;" ; r c 

W GGGTCAT GTGGCT GGAAT TACCAC GGTGCT T GTATAACT GTAAAGAT AT T GTATTACGAAGAAAC ACAGCT GGAAGT C 

lh T GGGCTT CTGCATTGTAGGAGGTTAT GAAGAATACAAT GGAAACAAAC CTTTTTT CAT CAAAT CCATT GTT GAAGGAA 

CAC CAGCATACAAT GAT GGAAGAAT TAGAT GT GGT GATAT T CTT CTTGCT GT CAAT GGTAGAAGTACAT CAGGAAT GA 
T ACAT GCTTGCTT GGCAAGACT GCT GAAAGAACTTAAAGGAAGAATTACT CTAACTATTGTTTCTT GGC C T GGCACTT 
TTTT ATAGA AT CAAT GATGGGT CAGAGGAAAACAGAAAAAT CACAAATAGGCTAAGAAGTT GAAACACTATATTTATC 
TTGTCAGTTTTTATAT TTAAAGAAAGAATACAT T GTAAAAATGT CAGGAAAAGTAT GAT CATCTAATGAAAGCCAGTT 
ACACCT CAGAAAATAT GATT CCAAAAAAATTAAAACTACTAGTTTTTT TT CAGT GT GGAGGATTT CTCATT ACT CTAC 
AACATTGTTTATATTTTTTCTATTCAATAAAAAGCCCTAAAACAACTAAAATGATTGATTTGTATACCCCACTGAATT 
CAAGCTGATTTAAATTTAAAATTTGGTATATGCTGAAGTCTGCCAAGGGTACATTATGGCCATTTTTAATTTACAGCT 
AAAATATTTTTTAAAAT GCAT TG CT GAGAAAC GTT GCTT T CAT CAAACAAGAAT AAATATT TTT CAGAAGTTAAA. 
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FIGURE do 



MKALLLLVLPWLSPANYIDNVGNLHFLYSELCKGASHYGLTKDRKRRSQDGCPDGCASLTATAPS 
PEVSAAAT I SLMTDE PGLDNPAYVS SAEDGQPAI SPVDSGRSNRTRARPFERS T IRSRS FKKINR 
ALSVLRRTKSGSAVANHADQGRENSENTTAPEVFPRLYHLIPDGEITSIKINRVDPSESLSIRLV 
GGSETPLVHIIIQHIYRDGVIARDGRLLPGDIILKVNGMDISNVPHNYAVRLLRQPCQVLWLTVM 
REQKFRSRNNGQAPDAYRPRDDSFHVILNKSSPEEQLGIKLVRKVDEPGVFIFNVLDGGVAYRHG 
QLEENDRVLAINGHDLRYGSPESAAHLIQASERRVHLWSRQVRQRSPDIFQEAGWNSNGSWSPG 
PGERSNTPKPLHPTITCHEKWNIQKDPGESLGMTVAGGASHREWDLPIYVISVEPGGVISRDGR 
IKTGDILLOTDGVELTEVSRSEAVALLKRTSSS^ 

SDWSPSWVMWLELPRCLYNCKDIVLRRNTAGSLGFCIVGGYEEYNGNKPFFIKSIVEGTPAYNDG 
RIRCGDILLAVNGRSTSGMIHACLARLLKELKGRITLTIVSWPGTFL 

Important features : 
Signal peptide: 

amino acids 1-15 

N-glycosylation sites. 

amino acids 108-112, 157-161, 289-293, 384-388 

Tyrosine kinase phosphorylation sites. 

amino acids 433-441, 492-500 

N-myristoylation sites. 

amino acids 51-57, 141-147, 233-239, 344-350, 423-429, 447-453, 
467-473, 603-609 
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FIGURE Ai 



ACCAGGCAT TGTATCT TCAGTTGTCAT CAAGT T CGCAAT CAGATT GGAAAAGCTCAACTT GAAGCTTT 
CTTGCCTGCAGTGAAGCAGAGAGATAGATATT ATT CACGT AATAAAAAACATGGGCTTCAACCT GACT 
TTCCACCTTTCCTACAAATTCCGATTACTGTTGCTGTTGACTTTGTGCCTGACAGTGGTT.GGGTGGGC 
CACCAGTAACTACTTCGTGGGTGCCATTCAAGAGATTCCTAAAGCAAAGGAGTTCATGGCTAATTTCC 
ATAAGACCCTCAT TTT GGGGAAGGGAAAAACT CTGACTAATGAAGCAT CCACGAAGAAGGTAGAACTT 
GACAACTGTCCTTCTGTGTCTCCTTACCTCAGAGGCCAGAGCAAGCTCATTTTCAAACCAGATCTCAC 
TTT GGAAGAGGTACAGGCAGAAAAT CCCAAAGTGTCCAGAGGCCGGTAT CGC CCTCAGGAAT GT AAAG 
CTTTACAGAGGGTCGCCATCCTCGTTCCCCACCGGAACAGAGAGAAACACCTGATGTACCTGCTGGAA 
CATCTGCATCCCTTCCTGCAGAGGCAGCAGCTGGATTATGGCATCTACGTCATCCACCAGGCTGAAGG 
TAAAAAGTT TAATCGAGCCAAACT CTT GAATGT GGGCTATCTAGAAGCCCT CAAGGAAGAAAATT GGG 
■Z\ ACTGCTT TATATT CCACGAT GTGGACCTGGTACCCGAGAATGACT TTAACCTT TACAAGTGTGAGGAG 

Q\ CATCCCAAGCATCTGGTGGTTGGCAGGAACAGCACTGGGTACAGGTTACGTTACAGTGGATATTTTGG 
y| GGGT GTTACTGCCCTAAGCAGAGAGCAGT TTT T CAAGGT GAAT GGATT C TCTAACAACTACTGGGGAT 

U I GGGGAGGCGAAGACGATGACCTCAGACTCAGGGTTGAGCTCCAAAGAATGAAAATTTCCCGGCCCCTG 
3] CCTGAAGTGGGTAAATATACAATGGTCTTCCACACTAGAGACAAAGGCAATGAGGTGAACGCAGAACG 
kH GATGAAGCTCTTACACCAAGTGT CACGAGTCT GGAGAACAGAT GGGTT GAGTAGTTGTTCT TATAAAT 

%, TAGTATCTGTGGAACACAATCCTTTATATATCAACATCACAGTGGATTTCTGGTTTGGTGCATGACCC 
• f < TGGATCTTTTGGTGATGTTTGGAAGAACTGATTCTTTGTTTGCAATAATTTTGGCCTAGAGACTTCAA 

y i 

r*1 ATAGTAGCACACATTAAGAACCTGTTACAGCTCATTGTTGAGCTGAATTTTTCCTTTTTGTATTTTCT 
f|| TAGCAGAGCTCCTGGTGAT GTAGAGTATAAAACAGT TGTAACAAGACAGCT T TCTTAGTCATTT T GAT 

CI CATGAGGGTTAAATATTGTAATATGGATACTTGAAGGACTTTATATAAAAGGATGACTCAAAGGATAA 
j SJ AATGAAC GCTATTTGAGGACTCT GGTTGAAGGAGAT T TAT TTAAATT TGAAGTAATATAT T ATGGGAT 

AAAAGGCCACAGGAAATAAGACTGCTGAATGTCTGAGAGAACCAGAGTTGTTCTCGTCCAAGGTAGAA 
AGGTACGAAGATACAATACTGTTATTCATTTATCCTGTACAATCATCTGTGAAGTGGTGGTGTCAGGT 
GAGAAGGCGTCCACAAAAGAGGGGAGAAAAGGCGACGAATCAGGACACAGTGAACTTGGGAATGAAGA 
GGTAGCAGGAGGGTGGAGTGTCGGCTGCAAAGGCAGCAGTAGCTGAGCTGGTTGCAGGTGCTGATAGC 
CTTCAGGGGAGGACCTGCCCAGGTATGCCTTCCAGTGATGCCCACCAGAGAATACATTCTCTATTAGT 
TTTTAAAGAGTTTTTGTAAAATGATTTTGTACAAGTAGGATATGAATTAGCAGTTTACAAGTTTACAT 
ATTAACTAATAATAAATAT GT CTATCAAATACC TCT GTAGTAAAATGTGAAAAAGCAAAA 
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FIGURE 42 

MGFNLTFHLSYKFRLLLLLTLCLTWGWATSNYFVGAIQEIPKAKEFMANFHKTLILGKGKTLTN 
EAS TKKVELDNCPSVSPYLRGQSKLI FKPDLTLEEVQAENPKVSRGRYRPQECKALQRVAI LVPH 
RNREKHLMYLLEHLHPFLQRQQLDYGIYVIHQAEGKKFNRAKLLIWGYLEALKEENWDCFIFHDV 
DLVPENDFNLYKCEEHPKHLWGRNSTGYRLRYSGYFGGVTALSREQFFKVNGFSNNYWGWGGED 
DDLRLRVELQRMKISRPLPEVGKYTMVFHTRDKGNEVNAERMKLLHQVSRVWRTDGLSSCSYKLV 
SVEHNPLYINI TVDFWFGA 

Important features: 
Signal peptide: 

amino acids 1-27 

N-glycosylation sites. 

amino acids 4-8, 220-224, 335-339 

Xylose isomerase proteins. 

amino acids 191-202 
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FIGURE 43 



GCT CAAGACCCAGCAGT GGGACAGCCAGACAGACGGCACGATGGCACTGAGCTCCCAGAT C T GGG 

CCGCTTGCCTCCTGCTCCTCCTCCTCCTCGCCAGCCTGACCAGTGGCTCTGTTTTCCCACAACAG 

ACGGGACAACTTGCAGAGCTGCAACCCCAGGACAGAGCTGGAGCCAGGGCCAGCTGGATGCCCAT 

GTTCCAGAGGCGAAGGAGGCGAGACACCCACTTCCCCATCTGCATTTTCTGCTGCGGCTGCTGTC 

ATCGATCAAAGTGTGGGATGTGCTGCAAGACGTAGAACCTACCTGCCCTGCCCCCGTCCCCTCCC 

TTCCTTATTTATTCCTGCTGCCCCAGAACATAGGTCTTGGAATAAAATGGCTGGTTCTTTTGTTT 

TCCAAAAAAAAAAAAASAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

fi\ 

til 
111 



m 

Pi 
ill 
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FIGURE AA 

MALSSQIWAACLLLLLLLASLTSGSVFPQQTGQLAELQPQDRAGARASWMPMFQRRRRRDTHFPI 
CIFCCGCCHRSKCGMCCKT 

Important features: 
Signal peptide: 

amino acids 1-24 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 58-59 

N-myristoylation site. 

amino acids 44-50 

Prokaryotic membrane lipoprotein lipid attachment site* 

amino acids 1-12 
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FIGURE ax 



GTGGCTTCATTTCAGTGGCTGACTTCCAGAGAGCAATATGGCTGGTTCCCCAACATGCCTCACCC 
TCATCTATATCCTTTGGCAGCTCACAGGGTCAGCAGCCTCTGGACCCGTGAAAGAGCTGGTCGGT 
TCCGTTGGTGGGGCCGTGACTTTCCCCCTGAAGTCCAAAGTAAAGCAAGTTGACTCTATTGTCTG 
GACCT TCAACACAACCCC T CTTGTCACCATACAGCCAGAAGGGGGCAC TAT CATAGT GACCCAAA 
ATCGTAATAGGGAGAGAGTAGACTTCCCAGATGGAGGCTACTCCCTGAAGCTCAGCAAACTGAAG 
AAGAATGAC TCAGGGATC T AC TATGT GGGGATATACAGCT CAT CAC T C CAGCAGCCCT CCACCCA 
GGAGTACGTGCTGCATGTCTACGAGCACCTGTCAAAGCCTAAAGTCACCATGGGTCTGCAGAGCA 
ATAAGAAT GGCACC TGTGT GACCAAT C T GACAT GC TGCATGGAACATGGGGAAGAGGATGT GAT T 
M TATACCTGGAAGGCCCTGGGGCAAGCAGCCAATGAGTCCCATAATGGGTCCATCCTCCCCATCTC 

CTGGAGATGGGGAGAAAGTGATATGACCTTCATCTGCGTTGCCAGGAACCCTGTCAGCAGAAACT 
TCTCAAGCCCCATCCTTGCCAGGAAGCTCTGTGAAGGTGCTGCTGATGACCCAGATTCCTCCATG 
f s t GTCCTCCTGTGTCTCCTGTTGGTGCCCCTCCTGCTCAGTCTCTTTGTACTGGGGCTATTTCTTTG 
Iji GTT T CTGAAGAGAGAGAGACAAGAAGAGT ACAT TGAAGAGAAGAAGAGAGT G GACAT T T GT CGGG 

fill AAACT CC TAACATATGCCCCCAT TC TGGAGAGAACACAGAGTACGACACAATCCCTCACACTAAT 

^ f AGAACAAT CCTAAAGGAAGAT CCAGCAAATACGGT TT ACT CCAC TGT GGAAATACCGAAAAAGAT 

GGAAAATCCCCACTCACTGCTCACGATGCCAGACACACCAAGGCTATTTGCCTATGAGAATGTTA 
rjij TCTAGACAGCAGTGCACTCCCC TAAG TCT C TGC TCA 



■us 



IT' 
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FIGURE 46 

MAGSPTCLTLIYILWQLTGSAASGPVKELVGSVGGAVTFPLKSKVKQVDSIVWTFNTTPLVTIQP 
EGGTIIVTQNRNRERVDFPDGGYSLKLSKLKKNDSGIYYVGIYSSSLQQPSTQEYVLHVYEHLSK 
PKVTMGLQSNKNGTCVTNLTCCMEHGEEDVIYTWKALGQAANESHNGSILPISWRWGESDMTFIC 
VARNPVSRNFSSPILARKLCEGAADDPDSSMVLLCLLLVPLLLSLFVLGLFLWFLKRERQEEYIE 
EKKRVD I CRET PNICPHSGENTE YDT I PHTNRT I LKEDPANTVYS T VE I PKKMENPHS LLTMPDT 
PRLFAYENVI 

Important features: 
Signal peptide: 

amino acids 1-22 

Transmembrane domain: 

amino acids 224-250 

Leucine zipper pattern. 

amino acids 229-251 

N-glycosylation sites* 

amino acids 98-102, 142-146, 148-152, 172-176, 176-180, 204-208, 
291-295 
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FIGURE 47 



GGCTCGAGCGTTTCTGAGCCAGGGGTGAGCATGACCTGCTGCGAAGGATGGACATCCTGCAaTGG 
ATTCAGCCTGCTGGTTCTACTGCTGTTAGGAGTAGTTCTCAATGCGATACCTCTAATTGTCAGCT 
TAGTTGAGGAAGACCAATTTTCTCAAAACCCCATCTCTTGCTTTGAGTGGTGGTTCCCAGGAATT 
ATAGGAGCAGGTCTGATGGCCATTCCAGCAACAACAATGTCCTTGACAGCAAGAAAAAGAGCGTG 
CTGCAACAACAGAACTGGAATGTTTCTTTCATCATTTTTCAGTGTGATCACAGTCATTGGTGCTC 
TGTATTGCATGCTGATATCCATCCAGGCTCTCTTAAAAGGTCCTCTCATGTGTAATTCTCCAAGC 
AACAGT AAT GCCAAT T GT GAAT T TT CAT T GAAAAACAT CAGTGACAT T CATCCAGAATCC T T CAA 
CTTGCAGTGGTTTTTCAATGACTCTTGTGCACCTCCTACTGGTTTCAATAAACCCACCAGTAACG 
ACACCATGGCGAGTGGCTGGAGAGCATCTAGTT TCCACTTCGAT TCTGAAGAAAACAAACATAGG 
CTTATCCACTTCTCAGTATTTTTAGGTCTATTGCTTGTTGGAATTCTGGAGGTCCTGTTTGGGCT 
CAGTCAGATAGTCATCGGTTTCCTTGGCTGTCTGTGTGGAGTCTCTAAGCGAAGAAGTCAAATTG 
TGTAGTT TAATGGGAATAAAATGTAAGTATCAGTAGT TTGAAAAAAAAAAA 
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FIGURE 48 

MTCCEGWT SCNGFS LLVLLLLGWLNAI PL I VS LVEEDQFSQNP ISC FEWWFPG 1 1 GAGLMAI PA 
TTMS LTARKRACCNNRTGMFLS S FFS VI T VI GALYGMLI S I QALLKGPLMCNS P SNSNANCE FS L 
KNISDIHPESFNLQWFFNDSCAPPTGFNKPTSNDTMASGWRASSFHFDSEE3SIKHRLIHFSVFLGL 
LLVGILEVLFGLSQIVIGFLGCLCGVSKRRSQIV 

Important features: 
Transmembrane domains: 

amino acids 10-31 (type II), 50-72, 87-110, 191-213 
N-glycosylation sites. 

amino acids 80-84, 132-136, 148-152, 163-167 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 223-227 

N-myristoylation sites. 

amino acids 22-28, 54-60, 83-89, 97-103, 216-222 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 207-218 

TNFR/NGFR family cysteine-rich region protein. 

amino acids 4-12 



App_ID=l 0063565 



Page 200 



FIGURE 

ATCCGTTCTCTGCGCTGCCAGCTCAGGTGAGCCCTCGCCAAGGTGACCTCGCAGGACACTGGTGA 
AGGAGCAGT GAGGAACCT GCAGAGTCACACAGT TGC TGAC CAAT TGAGC T GT GAGCC TGGAGCAG 
ATCCGTGGGCTGCAGACCCCCGCCCCAGTGCCTCTCCCCCTGCAGCCCTGCCCCTCGAACTGTGA 
CATGGAGAGAGTGACCCTGGCCCTTCTCCTACTGGCAGGCCTGACTGCCTTGGAAGCCAATGACC 
CATT T GCCAATAAAGACGAT CCC TT CTAC TATGACT GGAAAAACC TGCAGC T GAGCGGAC T GATC 
TGCGGAGGGCTCCTGGCCATTGCTGGGATCGCGGCAGTTCTGAGTGGCAAATGCAAATACAZVGAG 
CAGCCAGAAGCAGCACAGT CC T GTACCT GAGAAGGCCAT CCCAC TCAT CAC TCCAGGC T CT GCCA 
CTACTTGC TGA GCACAGGACTGGCCTCCAGGGATGGCCTGAAGCCTAACACTGGCCCCCAGCACC 
TCCTCCCCTGGGAGGCCTTATCCTCAAGGAAGGACTTCTCTCCAAGGGCAGGCTGTTAGGCCCCT 
TTCTGATCAGGAGGCTTCTTTATGAATTAAACTCGCCCCACCACCCCCTCA 
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FIGURE so 

MERVTIALLLIAGLTALEANDPFMTKDDPFYYDWKNLQLSGLICGGLIoAIAGIAAVLSGKCKYKS 
S QKQHSPVPEKAI PL I TPGSAT TC 

Important features: 
Signal peptide: 

amino acids 1-16 

Transmembrane domain: 

amino acids 36-59 

N-myr i s toyl a ti on s i tes . 

amino acids 41-47, 45-51, 84-90 

Extracellular proteins SCP/Tpx-l/Ag5/PR-l/Sc7 . 

amino acids 54-67 
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FIGURE 51 

GTGGACTC TGAGAAGCCCAGGCAGT TGAGGACAGGAGAGAGAAGGC T GCAGACCCAGAGGGAGGG 
AGGACAGGGAGTCGGAAGGAGGAGGACAGAGGAGGGCACAGAGACGCAGAGCAAGGGCGGCAAGG 
AGGAGACCC TGGTGGGAGGAAGACAC TCT GGAGAGAGAGGGGGC TGGGCAGAGATGAAGTT CCAG 
GGGCCCCTGGCCTGCCTCCTGCTGGCCCTCTGCCTGGGCAGTGGGGAGGCTGGCCCCCTGCAGAG 
CGGAGAGGAAAGCACTGGGACAAATATTGGGGAGGCCCTTGGACATGGCCTGGGAGACGCCCTGA 
GCGAAGGGGTGGGAAAGGCCATTGGCAAAGAGGCCGGAGGGGCAGCTGGCTCTAAAGTCAGTGAG 
GCCCTTGGCCAAGGGACCAGAGAAGCAGTTGGCACTGGAGTCAGGCAGGTTCCAGGCTTTGGCGC 
AGCAGAT GCTT TGGGCAACAGGGTCGGGGAAGCAGCCCATGCTC TGGGAAACAC TGGGCACGAGA 
T TGGCAGACAGGCAGAAGATGT CAT T CGACACGGAGCAGATGCT GTCCGCGGCT CC TGGCAGGGG 
GTGCCTGGCCACAGTGGTGCTTGGGAAACTTCTGGAGGCCATGGCATCTTTGGCTCTCAAGGTGG 
CCTTGGAGGCCAGGGCCAGGGCAATCCTGGAGGTCTGGGGACTCCGTGGGTCCACGGATACCCCG 
G AAAC T C AG C AGGC AG C T T T G GAAT G AAT C C T C AGG GAG CTCCCTGGGGT C AAGGAG G C AAT G GA 
GGGCCACCAAACTTTGGGACCAACACTCAGGGAGCTGTGGCCCAGCCTGGCTATGGTTCAGTGAG 
AGCCAGCAACCAGAATGAAGGGTGCACGAATCCCCCACCATCTGGCTCAGGTGGAGGCTCCAGCA 
ACTCTGGGGGAGGCAGCGGCTCACAGTCGGGCAGCAGTGGCAGTGGCAGCAATGGTGACAACAAC 
AATGGCAGCAGCAGTGGTGGCAGCAGCAGTGGCAGCAGCAGTGGCAGCAGCAGTGGCGGCAGCAG 
TGGCGGCAGCAGTGGT GGCAGCAGT GG CAACAG T GG T GGCAGCAGAGGT GACAGCGGC AG T GAG T 
CCTCCTGGGGATCCAGCACCGGCTCCTCCTCCGGCAACCACGGTGGGAGCGGCGGAGGAAATGGA 
CATAAACCCGGGTGTGAAAAGCCAGGGAATGAAGCCCGCGGGAGCGGGGAATCTGGGATTCAGGG 
C T TCAGAGGACAGGGAGT T TCCAGCAACATGAGGGAAATAAGCAAAGAGGGCAATCGCCT CCT TG 
GAGGCTCTGGAGACAATTATCGGGGGCAAGGGTCGAGCTGGGGCAGTGGAGGAGGTGACGCTGTT 
GGTGGAGTCAATACTGTGAACTCTGAGACGTCTCCTGGGATGTTTAACTTTGACACTTTCTGGAA 
GAATT TTAAAT C CAAGCT GGGT T TCAT CAACTGGGAT GCCATAAACAAGGAC CAGAGAAGC T CTC 
GCATCCCGTGACCTCCAGACAAGGAGCCACCAGATTGGATGGGAGCCCCCACACTCCCTCCTTAA 
AACACCACCCTCTCATCACTAATCTCAGCCCTTGCCCTTGAAATAAACCTTAGCTGCCCCACAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 52 



"At* 

hi 



if! 



MKFQGPIACLLLALCLGSGEAGPLQSGEESTGTNIGEALGHGLGDALSEGVGKAIGKEAGGAAGSKVS 
EAL GQGTREAVGT GVRQVPGFGAADALGNRVGEAAHALGNT GHE I GRQAEDVI RHGADAVRGS WQGVP 
GHSGAWETSGGHGIFGSQGGLGGQGQGNPGGLGTPWVHGYPGNSAGSFGMNPQGAPWGQGGNGGPPNF 
GTNT QGAVAQPGYGSVRASNQNEGCTNPPPSGSGGGS SNSGGGS GSQS GSSGS GSNGDNNNGS S S GGS 
SSGSSSGSSSGGSSGGSSGGSSGNSGGSRGDSGSESSWGSSTGSSSGNHGGSGGGNGHKPGCEKPGNE 
ARGSGESGIQGFRGQGVSSNMREISKEGNRLLGGSGDNYRGQGSSWGSGGGDAVGGVNTVNSETSPGM 
FNFDTFWKNFKSKLGFINWDAINKDQRSSRIP 

Signal peptide: 

amino acids 1-21 



m 

= || N-glycosylation site. 

0! amino acids 265-269 



Glycosaminoglycan attachment site. 

amino acids 235-239, 237-241, 244-248, 255-259, 324-328, 388-392 

Casein kinase II phosphorylation site. 

amino acids 26-30, 109-113, 259-263, 300-304, 304-308 



N-my r i s t oyl a t i on si te . 

amino acids 17-23, 32-38, 42-48, 50-56, 60-66, 61-67, 64-70, 74-80, 

90-96, 96-102, 130-136, 140-146, 149-155, 152-158, 155-161, 

159-165, 163-169, 178-184, 190-196, 194-200, 199-205, 218-224, 

236-242, 238-244, 239-245, 240-246, 245-251, 246-252, 249-252, 

253-259, 256-262, 266-272, 270-276, 271-277, 275-281, 279-285, 

283-289, 284-290, 287-293, 288-294, 291-297, 292-298, 295-301, 

298-304, 305-311, 311-317, 315-321, 319-325, 322-328, 323-329, 

325-331, 343-349, 354-360, 356-362, 374-380, 381-387, 383-389, 
387-393, 389-395, 395-401 

Cell attachment sequence. 

amino acids 301-304 
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FIGURE gg 

GGAGAAGAGGTTGTGTGGGACAAGCTGCTCCCGACAGAAGGATGTCGCTGCTGAGCCTGCCCTGG 
CTGGGCCTCAGACCGGTGGCAATGTCCCCATGGCTACTCCTGCTGCTGGTTGTGGGCTCCTGGCT 
ACTCGCCCGCATCCTGGCTTGGACCTATGCCTTCTATAACAACTGCCGCCGGCTCCAGTGTTTCC 
CACAGCCCCCAAAACGGAACTGGTTTTGGGGTCACCTGGGCCTGATCACTCCTACAGAGGAGGGC 
TTGAAGGACTCGACCCAGATGTCGGCCACCTATTCCCAGGGCTTTACGGTATGGCTGGGTCCCAT 
CATCCCCTTCATCGTTTTATGCCACCCTGACACCATCCGGTCTATCACCAATGCCTCAGCTGCCA 
TTGCACCCAAGGATAATCTCTTCATCAGGTTCCTGAAGGCCTGGCTGGGAGAAGGGATACTGCTG 
AGTGGCGGTGACAAGTGGAGCCGCCACCGTCGGATGCTGACGCCCGCCTTCCATTTCAACATCCT 
GAAGTCCT AT AT AACGAT C T T CAAC AAGAGT GCAAACAT CAT GC T T GACAAGT GGCAGC AC C T GG 
CCT CAGAGGGCAGCAGT CGT C TGGACAT GTT TGAGCACATCAGCCT CATGACCT TGGACAGTCTA 
CAGAAAT GCAT C T TCAGCT T T GACAGCCATT GTCAGGAGAGGCCCAG T GAAT ATAT T GCCACCAT 
CT T GGAGCT CAGT GCCCT TGTAGAGAAAAGAAGCCAGCAT AT CCT CCAGCACATGGACTT TCTGT 
ATTACCTCTCCCATGACGGGCGGCGCTTCCACAGGGCCTGCCGCCTGGTGCATGACTTCACAGAC 
GCTGTCATCCGGGAGCGGCGTCGCACCCTCCCCACTCAGGGTATTGATGATTTTTTCAAAGACAA 
AGCCAAGTCCAAGAC TT T GGAT T T CAT T GAT GTGC T T C T GCT GAGCAAGGATGAAGATGGGAAGG 
CATTGTCAGATGAGGATATAAGAGCAGAGGCTGACACCTT CATGTT TGGAGGCCATGACACCACG 
GCCAGTGGCCTCTCCTGGGTCCTGTACAACCTTGCGAGGCACCCAGAATACCAGGAGCGCTGCCG 
ACAGGAGGTGCAAGAGCTTCTGAAGGACCGCGATCCTAAAGAGATTGAATGGGACGACCTGGCCC 
AGCTGCCCTTCCTGACCATGTGCGTGAAGGAGAGCCTGAGGTTACATCCCCCAGCTCCCTTCATC 
TCCCGATGCTGCACCCAGGACATTGTTCTCCCAGATGGCCGAGTCATCCCCAAAGGCATTACCTG 
CCTCATCGATATTATAGGGGTCCATCACAACCCAACTGTGTGGCCGGATCCTGAGGTCTACGACC 
CCTTCCGCTTTGACCCAGAGAACAGCAAGGGGAGGTCACCTCTGGCTTTTATTCCTTTCTCCGCA 
GGGCCCAGGAACTGCATCGGGCAGGCGTTCGCCATGGCGGAGATGAAAGTGGTCCTGGCGTTGAT 
GCTGCTGCACTTCCGGTTCCTGCCAGACCACACTGAGCCCCGCAGGAAGCTGGAATTGATCATGC 
GCGCCGAGGGCGGGCTTTGGCTGCGGGTGGAGCCCCTGAATGTAGGCTTGCAGTGACTTTCTGAC 
CCATCCACCTGTTTTTTTGCAGATTGTCATGAATAAAACGGTGCTGTCAAA 



FIGURE z& 



MSLLSLPWLGLRPVMSPWLLLLLWGSWLI^ 

L I TPTEEGLKDS TQMS AT YS QGFT WLGP 1 1 PFI VLCHPDT IRS I TNAS AAI APKDNLFIRFLKP 
WLGEGI LLSGGDKWSRHRRMLTPAFHFNI LKSYI T I FNKSANIMLDKWQHLASEGSSRLDMFEHI 
SLMTLDSLQKCIFSFDSHCQERPSEYIATILELSALVEKRSQHILQHMDFLYYLSHDGRRFHRAC 
RLVHDFTDAVIRERRRTLPTQGIDDFFKDKAKSKTLDFIDVLLLSKDEDGKALSDEDIRAEADTF 
MFGGHDTTASGLSWVLYNLARHPEYQERCRQEVQELLKDRDPKEIEWDDLAQLPFLTMCVKESLR 
LHPPAPFISRCCTQDIVLPDGRVIPKGITCLIDIIGVHHNPTVWPDPEVYDPFRFDPENSKGRSP 
U LAFIPFSAGPRNCIGQAFAMAEMKWIAIjMLLHFRFLPDHTEPRRKLELIMRAEGGLWL^ 

W vglq 

'"Alt, 

m 

I . i Important features : 

jji Transmembrane domains : 

■W amino acids 13-32 (type II), 77-102 

m 

Cytochrome P450 cysteine heme-iron ligand signature. 

' m amino acids 461^471 

M ■ 

N-glycosylation sites. 

i= " amino acids 112-116, 168-172 

ill 
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FIGURE 55 



ATCGCATCAATTGGGAGTACCATCTTCCTCATGGGACCAGTGAAACAGCTGAAGCGAATGTTTGA 
GCCTACTCGTTTGATTGCAACTATCATGGTGCTGTTGTGTTTTGCACTTACCCTGTGTTCTGCCT 
T T T GGT GGC ATAACAAGGGAC T T GCACTTAT C T TC TGCAT T T TGC AG T CT T T GGCAT T GAC GT GG 
TAGAGCCTTTCCTTCATACCATTTGCAAGGGATGCTGTGAAGAAGTGTTTTGCCGTGTGTCTTGC 
ATAATTCATGGCCAGTTTTATGAAGCTTTGGAAGGCACTATGGACAGAAGCTGGTGGACAGTTTT 
GTAACTATCTTCGAAACCTCTGTCTTACAGACATGTGCCTTTTATCTTGCAGCAATGTGTTGCTT 
GTGAT T CGAACAT T TGAGGGT TAC T T T T GGAAGCAACAATACATTC T CGAACCTGAATGT CAGTA 
GCACAGGATGAGAAGTGGGTTCTGTATCTTGTGGAGTGGAATCTTCCTCATGTACCTGTTTCCTC 
T CT GGAT GT T GTCCCACT GAATTCCCAT GAATACAAACCT AT TCAGCAACAGCAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^^ 
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FIGURE 56 

MGPVKQLKRMFE PTRL I AT IMVLLC FALTLCSAFWWHNKGLAL I FC ILQS LALTWYSLS FI PFAR 
DAVKKC FAVCLA 



Important features: 
Signal peptide: 

amino acids 1-33 



i^i Type II fibronectin collagen-binding domain protein* 

Q amino acids 30-72 

m 

UJ 

ill 
m 
m 

s 

o 

ill 

a 
ni 
a 
in 
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FIGURE 5Z 

CGGCTCGAGCTCGAGCCGAATCGGCTCGAGGGGCAGTGGAGCACCCAGCAGGCCGCCAACATGCTCTGTCTGTGCCTG 
TACGTGCCGGTCATCGGGGAAGCCCAGACCGAGTTCCAGTACTTTGAGTCGAAGGGGCTCCCTGCCGAGCTGAAGTCC 
ATTTT CAAG CTCAGT GTCTT CAT CCCCT C CCAGGAATT CTCCACCTAC CGCCAGT GGAAGCAGAAAATT GTACAAGCT 
GGAGATAAGGAC CTT GAT G GGCAGCTAGACTT T GAAGAATTT GT CCATT ATCT C CAAGAT CAT GAGAAGAAG CT GAGG 
CT GGT GTTTAAGATTTT GGACAAAAAGAAT GAT G GACG CATT GAC GCGCAGGAGATCAT GCAGT C CCT GCGGGACTT G 
GGAGT CAAGAT AT CT GAACAGCAGGCAGAAAAAAT T CT CAAGAGCAT GGATAAAAAC GGCACGAT GACC ATC GACTGG 
AACGAGTGGAGAGACTACCACCTCCTCCACCCCGT GGAAAACATCCCCGAGATCATCCTCTACT GGAAGCATT CCACG 
ATCTTTGATGTGGGTGAGAATCTAACGGTCCCGGATGAGTTCACAGTGGAGGAGAGGCAGACGGGGATGTGGTGGAGA 
CACCTGGTGGCAGGAGGTGGGGCAGGGGCCGTATCCAGAACCTGCACGGCCCCCCTGGACAGGCTCAAGGTGCTCATG 
CAGGT CCAT GCCT CCCGCAGCAACAACAT GGG CAT C GTT GGT G GCTTCACT CAGATGAT T CGAGAAGGAGGGGC CAGG 
T CACT CT GG CGG GGCAAT GGCAT CAACGTCCT CAAAATT GC CC CC GAATCAGCCAT CAAAT T CAT GGC CTAT GAGCAG 
ATCAAGCGCCTTGTTGGTAGTGACCAGGAGACTCTGAGGATTCACGAGAGGCTTGTGGCAGGGTCCTTGGCAGGGGCC 
( . AT C GC CCAGAGCAGCAT CTACC CAATGGAGGTCCT GAAGACC C GGATGGC GCT GCGGAAGACAGGCCAGTACT CAGGA 

AT GCT GGACT GCGCCAGGAGGAT CCTGGCCAGAGAGGG GGT GGCC GCCTT CTACAAAGGCTAT GTC CCCAACAT G CT G 
f*\ GGCAT CAT CCC CT AT GC CGGCATCGAC CTT GCAGT CTACGAGACGCT CAAGAATGC CT GG CT GCAGCACTAT GCAGT G 

JE$ AACAGCGCGGACCCCGGCGTGTTTGTGCTCCTGGCCTGTGGCACCATGTCCAGTACCTGTGGCCAGCTGGCCAGCTAC 
M CCC CT G GCCCTAGTCAGGAC CCGGAT GCAGGCGCAAGCCT CTATT GAGGG C GCTC CGGAGGT GACCAT GAGCAGCCT C 

ft | TTCAAAC^TATCCTGCGGACCGAGGGGGCCTTCGGGCTGTACAGGGGGCTGGCCCCCAACTTCATGAAGGTCATCCC^ 
f'l GCTGTGAGCATCAGCTACGTGGTCTACGAGAACCTGAAGATCACCCTGGGCGTGCAGTCGCGGTGACGGGGGGAGGGC 
CGCCCGGCAGTGGACTCGCTGATCCTGGGCCGCAGCCTGGGGTGTGCAGCCATCTCATTCTGTGAATGTGCCAACACT 
LI! AAGCTGTCTCGAGCCAAGCTGTGAAAACCCTAGACGCACCCGCAGGGAGGGTGGGGAGAGCTGGCAGGCCCAGGGCTT 
GTCCTGCTGACCCCAGCAGACCCTCCTGTTGGTTCCAGCGAAGACCACAGGCATTCCTTAGGGTCCAGGGTCAGCAGG 
CTCCGGGCTCACATGTGTAAGGACAGGACATTTTCTGCAGTGCCTGCCAATAGTGAGCTTGGAGCCTGGAGGCCGGCT 
if] TAGTTCTTCCATTTCACCCTTGCAGCCAGCTGTTGGCCACGGCCCCTGCCCTCTGGTCTGCCGTGCATCTCCCTGTGC 

CCTCTTGCTGCCTGCCTGTCTGCTGAGGTAAGGTGGGAGGAGGGCTACAGCCCACATCCCACCCCCTCGTCCAATCCC 
* ATAATCCATGATGAAAGGTGAGGTCACGTGGCCTCCCAGGCCTGACTTCCCAACCTACAGCATTGACGCCAACTTGGC 
Q T GT GAAGGAAGAGGAAAGGATCTGGCCTTGTGGTCACT G GCAT CT GAGCC CTGCT GAT GGCTGGGGCT CTCGG GCAT G 

r=f| CTTGGGAGTGCAGGGGGCTCGGGCTGCCTGGCCTGGCTGCACAGAAGGCAAGTGCTGGGGCTCATGGTGCTCTGAGCT 

G GCCT GGACCCTGT CAGGAT GGGC CCCAC CT CAGAACCAAACT CACTGTC CCCACTGT GGCATGAGGGCAGT GGAGCA 
Q C CAT GTT T GAGGGCGAAG GGCAGAGCGT TT GT GT GTTCT GGGGAGGGAAGGAAAAGGT GTT GGAG GCCTT AATTAT GG 

m ACT GT T GGGAAAAGGGTTTT GTCCAGAAG GACAAGC CGGACAAATGAGC GACTT CTGTGCT T CCAGAGGAAGAC GAGG 

il }J GAGCAGGAGCTTGGCTGACTGCTCAGAGTCTGTTCTGACGCCCTGGGGGTTCCTGTCCAACCCCAGCAGGGGCGCAGC 
Q GGGACCAGCCCCACATTCCACTTGTGTCACTGCTTGGAACCTATTTATTTTGTATTTATTTGAACAGAGTTATGTCCT 
f{j AACTATT TTTATAGAT TTGTT TAATT AATAGCTT GT CATTTTCAAGTT CATTT TTTATTCATAT TTAT GTT CAT GGTT 

f 5;: GATT GTAC CTT CC CAAGCCC GCCCAGTG GGATGGGAGGAGGAG GAGAAGG GGGGC CTT GGGCC GCT GCAGT CACAT CT 

GTCCAGAGAAATT CCTTTT GGGACT GGAGGCAGAAAAGC GGCCAGAAGGCAGCAGCCCT GGCT CCTTT C CTTT GGCAG 
GTTGG GGAAGGGC TT GCCC CCAGCCTTAGGATT T CAGGGT TTGACT GGGGGCGT GGAGAGAGAGGGAGGAACCT CAAT 
AACCTT GAAGGTGGAATCCAGTTATTTCCT GCGCT GCGAGGGT TT CTTTAT TT CACTCT TTTCT GAAT GTCAAG GCAG 
TGAGGTGCCTCTCACTGTGAATTTGTGGTGGGCGGGGGCTGGAGGAGAGGGTGGGGGGCTGGCTCCGTCCCTCCCAGC 
CTTCTGCTGCCCTTGCTTAACAATGCCGGCCAACTGGCGACCTCACGGTTGCACTTCCATTCCACCAGAATGACCTGA 
T GAGGAAAT CT TCAATAGGATGCAAAGAT CAAT GCAAAAATTGTT ATATAT GAACATATAACT GGAGT CGT CAAAAAG 
CAAATTAAGAAAGAAT TGGAC GTTAGAAGTT GT CATTTAAAGCAGCCTT CTAAT AAAGTTGTTT CAAAGCT GAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE *8 

MLCLCLYVPVIGEAQTEFQYFESKGLPAELKSIFKLSVFIPSQEFSTYRQWKQKIVQAGDKDLDG 
QLDFEEFVHYLQDHEKKLRLVFKILDKKNDGRIDAQEIMQSLRDLGVKISEQQAEKILKSMDKNG 
TMT I DWNEWRDYHLLHPVENI PEI ILYWKHST I FDVGENLTVPDEFTVEERQTGMWWRHLVAGGG 
AGAVSRTCTAPLDRLKVmQVHASRSNNMGIVGGFTQMIREGGARSLWRGNGINVLKIAPESAIK 
FMAYEQIKRLVGSDQETLRIHERLVAGSIAGAIAQSSIYPMEVLKTRMALRKTGQYSGMLDCARR 
ILAREGVAAFYKGYVPNMLGIIPYAGIDIAWETLKNAW 

GQLAS YPLALVRTRMQAQAS IEGAPEVTMS S LFKH I LRTE GAFGLYRGLAPNFMKVT PAVS I S YV 
VYENLKI TLGVQSR 

Important features: 
Signal peptide: 

amino acids 1-16 

Putative transmembrane domains: 

amino acids 284-304, 339-360, 376-394 

Mitochondrial energy transfer proteins signature. 

amino acids 206-215, 300-309 

N-glycosylation sites. 

amino acids 129-133, 169-173 

Elongation Factor-hand calcium-binding protein. 

amino acids 54-73, 85-104, 121-140 
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FIGURE 

GGAAGGCAGCGGCAGCTCCACTCAGCCAGTACCCAGATACGCTGGGAACCTTCCCCAGCCATGGC 
T T CCCT GGGGCAGATCC T CT TC TGGAGCATAAT TAGCAT CAT CATTAT TC TGGC TGGAGCAAT T G 
CACTCATCATTGGCTTTGGTATTTCAGGGAGACACTCCATCACAGTCACTACTGTCGCCTCAGCT 
GGGAACAT T GGGGAGGAT GGAATCC T GAGCT GCAC T T TT GAACCT GACAT CAAACT T T CT GATAT 
CGTGATACAATGGCTGAAGGAAGGTGTTTTAGGCTTGGTCCATGAGTTCAAAGAAGGCAAAGATG 
AGCTGTCGGAGCAGGATGAAATGTTCAGAGGCCGGACAGCAGTGTTTGCTGATCAAGTGATAGTT 
GGCAATGCCTCTTTGCGGCTGAAAAACGTGCAACTCACAGATGCTGGCACCTACAAATGTTATAT 
CATCACTTCTAAAGGCAAGGGGAATGCTAACCTTGAGTATAAAACTGGAGCCTTCAGCATGCCGG 
AAGTGAATGTGGACTATAATGCCAGCTCAGAGACCTTGCGGTGTGAGGCTCCCCGATGGTTCCCC 
CAGCCCACAGTGGTCTGGGCATCCCAAGTTGACCAGGGAGCCAACTTCTCGGAAGTCTCCAATAC 
CAGC T T T GAGC TGAACT CT GAGAAT GT GACCAT GAAGGT T GTGTC TGTGC T CTACAATGT TACGA 
T CAACAACACATAC TCC TGTATGAT TGAAAAT GACAT TGCCAAAGCAACAGGGGAT AT CAAAGTG 
ACAGAATCGGAGATCAAAAGGCGGAGTCACCTACAGCTGCTAAACTCAAAGGCTTCTCTGTGTGT 
CTCTTCTTTCTTTGCCATCAGCTGGGCACTTCTGCCTCTCAGCCCTTACCTGATGCTAAA ATAA T 
GTGCCTTGGCCACAAAAAAGCATGCAAAGT CAT TGTTACAACAGGGAT CTACAGAACTATTTCAC 
CACCAGATATGACCTAGT TTTATATT TCTGGGAGGAAATGAAT TCATATCTAGAAGTCTGGAGTG 
AGCAAAC AAGAGCAAGAAACAAAAAGAAG C CAAAAGCAG AAGGC T C C AATATGAACAAGAT AAAT 
C T ATC T T CAAAGACATATT AGAAGT T GGGAAAATAAT TCAT GT GAAC TAGACAAGTGT GTT AAGA 
GTGATAAGTAAAATGCACGTGGAGACAAGTGCATCCCCAGATCTCAGGGACCTCCCCCTGCCTGT 
CACC T GGGGAGT GAGAGGACAGGATAGTGCATGT TCT TTGT CT C TGAATT TT TAGT TATAT GT GC 
TGTAATGTTGCTCTGAGGAAGCCCCTGGAAAGTCTATCCCAACATATCCACATCTTATATTCCAC 
AAATTAAGCTGTAGTATGTACCCTAAGACGCTGCTAATTGACTGCCACTTCGCAACTCAGGGGCG 
GCTGCATTTTAGTAATGGGTCAAATGATTCACTTTTTATGATGCTTCCAAAGGTGCCTTGGCTTC 
TCTTCCCAACTGACAAATGCCAAAGT TGAGAAAAATGAT CATAAT TT TAGCATAAACAGAGCAGT 
CGGGGACACCGAT T T TAT AAAT AAAC T GAGCAC C T TC TT T T TAAACAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 60 



MASLGQILFWSIISIIIILAGAIALIIGFGISGRHSITVTTVASAGNIGEDGILSCTFEPDIKLS 
D I VI QWLKE GVLGLVHE FKE GKDE L S EQDEMFRGRTAVFADQV I VGNAS LRLKNVQLT DAGT YKC 
Y 1 1 TSKGKGNANLEYKTGAFSMPEVNVDYNAS SETLRCEAPRWFPQPTWWASQVDQGANFSEVS 
NTSFELNSENVTMKWSVLYNVTIN^^ 
CVSSFFAISWALLPLSPYLMLK 

Important features: 
Signal peptide: 

amino acids 1-28 

Transmembrane domain: 

amino acids 258-281 

N-glycosylation sites. 

amino acids 112-116, 160-164, 190-194, 196-200, 205-209, 216-220, 
220-224 

N-myristoylation sites. 

amino acids 52-58, 126-132, 188-194 
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FIGURE 6l 



TGACGTCAGAATCACCATCGCCAGCTATCCTTACCGGCAGGGCTGCCCAGGAGCTGCAGGACAAG 
CACCAGGAGCCCCTCCGGGTAGCTACTACCCTGGACCCCCCAATAGTGGAGGGCAGTATGGTAGT 
GGGCTACCCCCTGGTGGTGGTTATGGGGGTCCTGCCCCTGGAGGGCCTTATGGACCACCAGCTGG 
TGGAGGGCCCTATGGACACCCCAATCCTGGGATGTTCCCCTCTGGAACTCCAGGAGGACCATATG 
GCGGTGCAGCTCCCGGGGGCCCCTATGGTCAGCCACCTCCAAGTTCCTACGGTGCCCAGCAGCCT 
GGGCTTTATGGACAGGGTGGCGCCCCTCCCAATGTGGATCCTGAGGCCTACTCCTGGTTCCAGTC 
GGTGGAC T CAGATCACAGT GGCTATATCT CCAT GAAGGAGC T AAAGCAGGCCCT GGTCAACTGCA 
AT TGGT CT TCAT TCAATGATGAGACC T GCCT CATGAT GATAAACAT GT T T GACAAGACCAAGTCA 
GGCCGCATCGATGTCTACGGCTTCTCAGCCCTGTGGAAATTCATCCAGCAGTGGAAGAACCTCTT 
CCAGCAGTATGACCGGGACCGCTCGGGCTCCATTAGCTACACAGAGCTGCAGCAAGCTCTGTCCC 
AAATGGGCTACAACCTGAGCCCCCAGTTCACCCAGCTTCTGGTCTCCCGCTACTGCCCACGCTCT 
GCCAATCCTGCCATGCAGCTTGACCGCTTCATCCAGGTGTGCACCCAGCTGCAGGTGCTGACAGA 
GGCCTTCCGGGAGAAGGACACAGCTGTACAAGGCAACATCCGGCTCAGCTTCGAGGACTTCGTCA 
CCATGACAGCT TCTCGGAT GCT ATGAC CCAACCATCT GTGGAGAGTGGAGT GCACCAGGGACCTT 
TCCTGGCTTCTTAGAGTGAGAGAAGTATGTGGACATCTCTTCTTTTCCTGTCCCTCTAGAAGAAC 
ATTCTCCCTTGCTTGATGCAACACTGTTCCAAAAGAGGGTGGAGAGTCCTGCATCATAGCCACCA 
AATAGTGAGGACCGGGGCTGAGGCCACACAGATAGGGGCCTGATGGAGGAGAGGATAGAAGTTGA 
ATGTCCTGATGGCCATGAGCAGTTGAGTGGCACAGCCTGGCACCAGGAGCAGGTCCTTGTAATGG 
AGTTAGTGTCCAGTCAGCTGAGCTCCACCCTGATGCCAGTGGTGAGTGTTCATCGGCCTGTTACC 
GTTAGTACCTGTGTTCCCTCACCAGGCCATCCTGTCAAACGAGCCCATTTTCTCCAAAGTGGAAT 
CTGACCAAGCATGAGAGAGATCTGTCTATGGGACCAGTGGCT TGGATT CTGCCACACCCATAAAT 
CCTTGTGTGTTAACTTCTAGCTGCCTGGGGCTGGCCCTGCTCAGACAAATCTGCTCCCTGGGCAT 
CTTTGGCCAGGCTTCTGCCCCCTGCAGCTGGGACCCCTCACTTGCCTGCCATGCTCTGCTCGGCT 
TCAGTC TCCAGGAGACAGT GGTCACC T CT CCCT GCCAATACT T TT TT TAAT T T GCATT T T T T T TC 
AT T TGGGGC CAAAAGTCCAG TGAAAT T GTAAGCT T CAATAAAAGGAT GAAAC T C TGA 
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FIGURE 62 

MASYPYRQGCPGAAGQAPGAPPGSYYPGPPNSGGQYGSGLPPGGGYGGPAPGGPYGPPAGGGPYG 
HPNPGMFPSGTPGGPYGGAAPGGPYGQPPPSSYGAQQPGLYGQGGAPPNVDPEAYSWFQSVDSDH 
SGYISMKELKQALTOCNWSSFNDETCLMMINMFDKTKSGRIDVYGFSALWKFIQQW 
DRSGSISYTELQQALSQMGYNLSPQFTQLLVSRYCPRSANPAMQLDRFIQVCTQLQVLTEAFREK 
DTAVQGNI RLS FEDFVTMTASRML 

Important features of the protein: 
Signal peptide: 

amino acids 1-19 

N-glycosylation site. 

amino acids 147-150 

Casein kinase II phosphorylation sites. 

amino acids 135-138, 150-153, 202-205, 271-274 

N-myristoylation sites. 

amino acids 9-14, 15-20, 19-24, 33-38, 34-39, 39-44, 43-48, 61- 
66, 70-75, 78-83, 83-88, 87-92, 110-115 
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FIGURE 63 

CAGGATGCAGGGCCGCGTGGCAGGGAGCTGCGCTCCTCTGGGCCTGCTCCTGGTCTGTCTTCATC 
TCCCAGGCCTCTTTGCCCGGAGCATGGGTGTTGTGGAGGAGAAAGTTTCCCAAAACTTCGGGACC 
AACTTGCCTCAGCTCGGACAACCTTCCTCCACTGGCCCCTCTAACTCTGAACATCCGCAGCCCGC 
TCTGGACCCTAGGTCTAATGACTTGGCAAGGGTTCCTCTGAAGCTCAGCGTGCCTCCATCAGATG 
GCTTCCCACCTGCAGGAGGTTCTGCAGTGCAGAGGTGGCCTCCATCGTGGGGGCTGCCTGCCATG 
GATTCCTGGCCCCCTGAGGATCCTTGGCAGATGATGGCTGCTGCGGCTGAGGACCGCCTGGGGGA 
AGCGCTGCCTGAAGAACTCTCTTACCTCTCCAGTGCTGCGGCCCTCGCTCCGGGCAGTGGCCCTT 
TGCCTGGGGAGTCTTCTCCCGATGCCACAGGCCTCTCACCTGAGGCTTCACTCCTCCACCAGGAC 
TCGGAGTCCAGACGACTGCCCCGTTCTAATTCACTGGGAGCCGGGGGAAAAATCCTTTCCCAACG 
CCCTCCCTGGTCTCTCATCCACAGGGTTCTGCCTGATCACCCCTGGGGTACCCTGAATCCCAGTG 
TGTCCTGGGGAGGTGGAGGCCCTGGGAGTGGTTGGGGAACGAGGCCCATGCCAGACCCTGAGGGA 
ATCTGGGGTATCAATAATCAACCCCCAGGTACCAGCTGGGGAAATATTAATCGGTATCCAGGAGG 
CAGC TGGGGAAATAT T AAT CGGTAT CCAGGAGGCAGC T GGGGGAATAT TAAT CGG TATCCAGGAG 
GCAGCTGGGGGAATATTCATCTATACCCAGGTATCAATAACCCATTTCCTCCTGGAGTTCTCCGC 
CCTCCTGGCTCTTCTTGGAACATCCCAGCTGGCTTCCCTAATCCTCCAAGCCCTAGGTTGCAGTG 
GGGCTAGAGCACGATAGAGGGAAACCCAACATTGGGAGTTAGAGTCCTGCTCCCGCCCCTTGCTG 
TGTGGGCTCAATCCAGGGCCTGTTAACATGTTTCCAGCACTATCCCCACTTTTCAGTGCCTCCCC 
T GCTCATCTCCAATAAAATAAAAGCACT TAT GAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAMAAAAAAAAMAAAAAAAAAAAAAAAAAAAAAAAA 



FIGURE 6d 

MQGRVAGSCAPLGLLLVCLHLPGLFARSIGWEEKVSQNFGTNLPQLGQPSSTGPSNSEHPQPAL 
DPRSNDLARVPLKLSVPPSDGFPPAGGSAVQRWPPSWGLPAMD^ 

LPEELSYLSSAAALAPGSGPLPGESSPDATGLSPEASLLHQDSESRRLPRSNSLGAGGKILSQRP 
PWSLIHRVLPDHPWGTLNPSVSWGGGGPGTGWGTRPMPHPEGIWGINNQPPGTSWGNINRYPGGS 
WGNINRYPGGSWGNINRYPGGSWGNIHLYPGINNPFPPGVLRPPGSSWNIPAGFPNPPSPRLQWG 

Important features of the protein: 
Signal peptide: 

amino acids 1-26 

Casein kinase II phosphorylation sites. 

amino acids 56-59, 155-158 

N-myristoylation sites. 

amino acids 48-53, 220-225, 221-226, 224-229, 247-252, 258-263, 
259-264, 269-274, 270-275, 280-285, 281-286, 305-310 



AppJD=10063565 



Page 216 of 320 



FIGURE 6s 

AAGGAGAGGCCACCGGGACTTCAGTGTCTCCTCCATCCCAGGAGCGCAGTGGCCACTATGGGGTC 
TGGGCTGCCCCTTGTCCTCCTCTTGACCCTCCTTGGCAGCTCACATGGAACAGGGCCGGGTATGA 
CTTTGCAACTGAAGCTGAAGGAGTCTTTTCTGACAAATTCCTCCTATGAGTCCAGCTTCCTGGAA 
TTGCTTGAAAAGCTCTGCCTCCTCCTCCATCTCCCTTCAGGGACCAGCGTCACCCTCCACCATGC 
AAGATCT CAACACCAT GT T GTC TGCAACACATGACAGCCAT T GAAGCC TGT GT CCT T CT T GGCCC 
GGGCTTTTGGGCCGGGGATGCAGGAGGCAGGCCCCGACCCTGTCTTTCAGCAGGCCCCCACCCTC 
C TGAGT GGCAATAAATAAAAT TC GG TAT GGTG 
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FIGURE 66 

mgsglplvllltllgsshgtgp'gmtlqlklkesfltnssyessflelleklclllhlpsgtsvtl" 
hharsqhhwcnt 



Important features: 
Signal peptide: 

amino acids 1-19 

N-glycosylation site, 

amino acids 37-41 

N-myristoylation sites. 

amino acids 15-21, 19-25, 60-66 
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FIGURE 67 
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ACGGACCGAGGGTTCGAGGGAGGGACACGGACCAGGAACCTGAGCTAGGTCAAAGACGCCCGGGC 
CAGGTGCCCCGTCGCAGGTGCCCCTGGCC6GAGATGGGGTAGGAGGGGCGAGCGCGAGAAGCCCC 
TTCCTCGGCGCTGCCAACCCGCCACCCAGCCCATGGCGAACCCCGGGCTGGGGCTGCTTCTGGCG 
CTGGGCCTGCCGTTCCTGCTGGCCCGCTGGGGCCGAGCCTGGGGGCAAATACAGACCACTTCTGC 
AAATGAGAATAGCACTGTTTTGCCTTCATCCACCAGCTCCAGCTCCGATGGCAACCTGCGTCCGG 
AAGCCATCACTGCTATCATCGTGGTCTTCTCCCTCTTGGCTGCCTTGCTCCTGGCTGTGGGGCTG 
GCACTGTTGGTGCGGAAGCTTCGGGAGAAGCGGCAGACGGAGGGCACCTACCGGCCCAGTAGCGA 
GGAGCAGTTCTCCCATGCAGCCGAGGCCCGGGCCCCTCAGGACTCCAAGGAGACGGTGCAGGGCT 
GCCTGCCCATCTAGGTCCCCTCTCCTGCATCTGTCTCCCTTCATTGCTGTGTGACCTTGGGGAAA 
GGCAGTGCCCTCTCTGGGCAGTCAGATCCACCCAGTGCTTAATAGCAGGGAAGAAGGTACTTCAA 
AGACTCTGCCCCTGAGGTCAAGAGAGGATGGGGCTATTCACTTTTATATATTTATATAAAATTAG 
TAG T GAGAT GTAAAAAAAAAAAAAAAAAA 
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FIGURE 68 
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MANPGLGLLLALGLPFLLARWGRAWGQIQTTSANENSTVLPSSTSSSSDGNLRPEAITAIIWFS 
LLAALLLAVGLALLWKLREKRQTEGTYRPSSEEQFSHAAEARAPQDSKETVQGCLPI 

Important features: 
Signal peptide: 
amino acids 1-19 

Transmembrane domain: 

amino acids 56-80 

N-glycosylation site. 

amino acids 36-40 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 86-90 

Tyrosine kinase phosphorylation site. 

amino acids 86-94 

N-myristoylation sites . 

amino acids 1-13, 26-32 
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FIGURE 6q 



GC CAGGAATAACTAGAGAGGAACAATGGGGT TATT CAGAGGTT TT GTT TT CCT CTTAGTT CT GTGCCT GCT GCACCAG 
TCAAATACTT CCTT CATTAAGCT GAATAATAAT GGCTTTGAAGATATT GTCAT T GTTATAGAT CC TAGTGT GC CAGAA 
GAT GAAAAAATAAT TGAACAAATAGAGGATAT GGT GACTACAGCTT CTACGTACCT GTTT GAAGC CACAGAAAAAAGA 
TTTTTTTTCAAAAATGTATCTATATTAATTCCTGAGAATTGGAAGGAAAATCCTC^GTACAAAAGGCCAAAACATGAA 
AACCATAAACAT GCT GATGTT ATAGTT GCACCACCTACACT CCCAGGTAGAGATGAACCATACAC CAAGCAGT T CACA 
GAAT GTGGAGAGAAAGGCGAATACATT CACTT CAC C C C TGACCTT CTACT T G GAAAAAAACAAAAT GAATAT GGACCA 
CCAGGCAAACTGTTTGTCCATGAGTGGGCTCACCTCCGGTGGGGAGTGTTTGATGAGTACAATGAAGATCAGCCTTTC 
TACCGTGCT AAGTCAAAAAAAAT CGAAGCAACAAGGT GTT C CGCAGGTAT CT CTGGTAGAAATAGAGT TTATAAGT GT 
CAAG GAGGCAG CT GT CTTAGTAGAGCAT GCAGAATTGATT CTACAACAAAACT GTAT GGAAAAGATT GTCAATTCTT T 
CCTGATAAAGTACAAACAGAAAAAGCATCC ATAAT GTT TAT GCAAAGTATT GATT CT GTT GTT GAATT TT GTAACGAA 
AAAACCCATAAT CAAGAAGCTCCAAGCCTACAAAACATAAAGT GCAATTTTAGAAGT ACAT GG GAGGT GATTAGCAAT 
TCT GAGGATTTTAAAAACACCATACC CATGGT GACACCACCT CCT CCACCT GT CT T CT CATT GCT GAAGAT CAGT CAA 
AGAATTGTGT GCTTAGTT CT T GATAAGT CT GGAAGCATGGGGGGTAAGGACCGCCTAAAT CGAATGAATCAAGCAGCA 
AAACATTTCCTGCTGCAGACTGTTGAAAATGGATCCTGGGTGGGGATGGTTCACTTTGATAGTACTGCCACTATTGTA 
AATAAG C TAAT CCAAATAAAAAG CAGT GAT GAAAGAAAC ACACTCAT GGCAGGAT T ACCT ACATAT C CTCT GGGAGGA 
ACT T CCAT CT GCT CT GGAATTAAATAT GCATT T CAGGT GAT T G GAGAGCT ACATT C CC AACT CGAT GGATCCGAAGT A 
CT GCTG CT GACT GAT GGGGAGGATAACACT G CAAGT TCTT GT ATT GAT GAAGT GAAACAAAGTGGGGCCAT TGTT CAT 
TTTATTGCTTTGGGAAGAGCTGCTGATGAAGCAGTAATAGAGATGAGCAAGATAACAGGAGGAAGTCATTTTTATGTT 
TCAGAT GAAGCT CAGAACAAT GGCCTCATT GAT GCT TTTGGGGCT CT TACAT CAGGAAATACT GAT CTCT CCCAGAAG 
T C CCT T CAGCT C GAAAGTAAGGGAT TAACACT GAATAGTAAT GCCTGGAT GAACGACACT GTCATAATTGATAGTACA 
GTGGGAAAGGACACGTTCTTTCTCATCACATGGAACAGTCTGCCTCCCAGTATTTCTCTCTGGGATCCCAGTGGAACA 
ATAAT GGAAAATT TCACAGT GGAT GCAACTT CCAAAAT GGC CTAT CTCAGTAT TCCAGGAACT GCAAAGGT GGGCACT 
T GGGCATACAAT CTT CAAGCCAAAGC GAAC CCAGAAACAT TAACT ATT ACAGTAACT T CT CGAGCAGCAAATT CTT CT 
GTGCCTCCAATCACAGTGAATGCTAAAATGAATAAGGACGTAAACAGTTTCCCCAGCCCAATGATTGTTTACGCAGAA 
ATT CTACAAGGATAT GTACCT GT TCTT GGAGCCAAT GT GACT GCTT T CAT T GAAT CACAGAAT GGACATACAGAAGT T 
TTGGAACTTTTGGATAATGGTGCAGGCGCT GATTCTTT CAAGAAT GAT GGAGT CTACT C CAGGTATTT TACAGCATAT 
ACAGAAAATGGCAGATATAGCT TAAAAGTT CGG GCT CATGGAGGAGCAAACACTGC CAGGCTAAAATTACGGC CT CCA 
CT GAATAGAGCC GCGTACATACC AGGCT GGGTAGT GAAC GG GGAAATT GAAGCAAACC C GCCAAGAC CT GAAATT GAT 
GAGGATACTCAGACCACCTTGGAGGATTTCAGCCGAACAGCATCCGGAGGTGCATTTGTGGTATCACAAGTCCCAAGC 
CTT CCCTT GCCT GAC CAATAC CCAC CAAGT CAAAT CACAGACCTT GATGCCACAGT T CAT GAGGATAAGATTATT CTT 
ACATGGACAGCACCAGGAGATAATTTT GAT GTT GGAAAAGTT CAACGTTATATCATAAGAATAAGT GCAAGTATT CTT 
GAT CTAAGAGACAGT TTT GAT GATGCT CTT CAAGTAAATACTACT GATCT GTCACCAAAGGAGGC CAACTC CAAGGAA 
AGCTT T GCATTTAAACCAGAAAATAT CT CAGAAGAAAATGCAACC CACATATT TAT T G C CATTAAAAGTATAGAT AAA 
AGCAATTT GACAT CAAAAGTATC CAACATT GCACAAGTAACT TTGTTTAT CC CT CAAGCAAAT CCT GATGACATT GAT 
CCTACAC CTACT CCT ACT CCTACT CCTACT CCT GATAAAAGT CATAATTCT GGAGT TAATATT T CTACG CT GGTATT G 
T CT GT GAT TGGGT CT GTT GTAATTGTTAACTTTAT TT TAAGTACCACCATT TGAACCTTAACGAAGAAAAAAAT CTT C 
AAGTAGAC CTAGAAGAGAGT T TT AAAAAACAAAACAATGTAAGTAAAGGATATTT CT GAAT CT TAAAATT CAT CC CAT 
GT GTGAT CATAAACT CATAAAAATAATTTTAAGAT GT CGGAAAAGGATACTT T GATTAAAT AAAAACACT CAT GGATA 
T GTAAAAACT GTCAAGAT TAAAAT TTAATAGTT TCAT TTATTT GTTAT TTTATTT GTAAGAAATAGTGAT GAACAAAG 
ATCCT TT TTCAT ACT GAT ACCT G GTT GTAT ATT ATTT GATGCAACAGT TTT CT GAAAT GATATT T CAAATT GCAT CAA 
GAAATTAAAATCATCTATCTGAGTAGTCAAAATACAAGTAAAGGAGAGCAAATAAACAACATTTGGAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



App_ID=l 0063565 



Page 221 of 320 



FIGURE 7Q 

MGLFRGFVFLLVLCLLHQSNTSFIKLNNNGFEDIVIVIDPSVPEDEKIIEQIEDMVTTASTYLFE 
ATEKRFFFKNVSILIPENWKENPQYKRPKHENHKHADVIVAPPTLPGRDEPYTKQFTECGEKGEY 
IHFTPDLLLGKKQNEYGPPGKLFVHEWAHLRWGVFDEYNEDQPFYRAKSKKIEATRCSAGISGRN 
RVYKCQGGS CL S RACR I DS TTKLYGKDCQ FFP DKVQTEKAS IM FMQS I DS WE FCNEKTHNQEAP 
SLQNIKCNFRSTWEVISNSEDFKNTIPMVTPPPPPVFSLLKISQRIVCLVLDKSGSMGGKDRLNR 
MNQAAKHFLLQTVENGSWGMraFDSTATIVNKLIQIKSSDERNTLMAGLPTYPLGGTSICSGIK 
YAFQVIGELHSQLDG5EVLLLTDGEDNTASSCIDEVKQSGAIVHFIALGRAADEAVIEMSKITGG 
SHFYVSDEAQNNGLIDAFGALTSGNTDLSQKSLQLESKGLTLNSNAWMNDTVIIDSTVGKDTFFL 
ITWNSLPPSISLWDPSGTIMENFTVDATSKMAYLSIPGTAKVGTWAYNLQAKANPETLTITVTSR 
AANS S VP P I T WAKMNKDVNS FP S PM I VYAE I LQG YVP VL GANVTAF I E S QNGH TEVLELLDNGA 
GADS FKNDGVYSRYFTAYTENGRYSLKVRAHGGANTARLKLRPPLNRAAYI PGWWNGE IEANPP 
RPEIDEDTQTTLEDFSRTASGGAFWSQVPSLPLPDQYPPSQITDLDATVHEDKIILTWTAPGDN 
FDVGKVQRYI I RI SAS I LDLRDS FDDALQVNT TDL S PKEANSKE S FAFKPENI S EENATHI FI AI 
KSIDKSNLTSKVSNIAQVTLFIPQANPDDIDPTPTPTPTPTPDKSHNSGVNISTLVLSVIGSWI 
VNFILSTTI 

Signal peptide : 

amino acids 1-21 

Putative transmembrane domains: 

amino acids 284-300, 617-633 

Leucine zipper pattern. 

amino acids 469-491, 476-498 

N-glycosylation site. 

amino acids 20-24, 75-79, 340-344, 504-508, 542-546, 588-592, 
628-632, 811-815, 832-836, 837-841, 852-856, 896-900 
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FIGURE 71 



CT C CT T AGGT GGAAACCCT GGGAGTAGAGTACT GACAGCAAAGAC CGGGAAAGACCATAC GT C CC CGGGCAGGGGT GA 
C7\ACAGGTGTCATCTTTTTGATCTGGTGTGTGGCTGCCTTCCTATTTCAAGGAAAGACGCCAAGGTAATTTTGACCCA 
GAGGAGCAATGATGTAGCCACCTCCTAACCTTCCCTTCTTGAACCCCCAGTTATGCCAGGATTTACTAGAGAGTGTCA 
ACTCAACCAGCAAGCGGCTCCTTCGGCTTAACTTGTGGTTGGAGGAGAGAACCTTTGTGGGGCTGCGTTCTCTTAGCA 
GTG CTCAGAAGT GACTT GC CT GAGGGT G GACCAGAAGAAAGGAAAGGT CCCCT CTT GCT GTT GGCT GCACAT CAGGAA 
GGCT GT GAT GGGAAT GAAGGT GAAAACTT GGAGATTT CACTT C AGT CATT GCTTCTGC CTGCAAGAT CAT CCTTTAAA 
AGTAGAGAAGCT GCT CT GTGTGGT GGTTAACT CCAAGAGGC AGAACT CGTT CTAGAAGGAAAT GGATGCAAGCAGCT C 
CGGGGGCCCCAAACGCATGCTTCCTGTGGTCTAGCCCAGGGAAGCCCTTCCGTGGGGGCCCCGGCTTTGAGGGATGCC 
ACCGGTTCTGGACGCATGGCTGATTCCTG AATGA TGATGGTTCGCCGGGGGCTGCTTGCGTGGATTTCCCGGGTGGTG 
GTT TT GCT GGT GCT C CT CT GCT GT GCTAT CT CT GTCCT GTACATGT T G GCCT GCAC CCCAAAAGGT GACGAGGAGCAG 
CT GG CACT GCCCAGGGC CAACAGCCCCACGGGGAAG GAGGGGTACCAG GCCGT CCTT CAGGAGTGGGAGGAGCAGCAC 
C GCAACTACGTGAGCAGCCT GAAGCGGCAGAT C GCACAGCT CAAGGAGGAGCT GCAGGAGAGGAGT GAGCAGCTCAG G 
AATGGGCAGTACCAAGCCAGCGATGCTGCTGGCCTGGGTCTGGACAGGAGCCCCCCAGAGAAAACCCAGGCCGACCTC 
CT GGCCT T CCT GCACTC GCAG GT GGACAAGGCAGAGGT GAAT GCT GGCGT CAAGCT GGCCACAGAGTATGCAGCAGT G 
CCT T T CGATAGCTTTACT CTACAGAAGGTGTACCAGCT GGAGACT GGCCTTACCCGCCAC CC CGAG GAGAAGCCT GT G 
AGGAAGGACAAG C GGGAT GAGT T GGT GGAAGC CATT GAAT CAGCCTT GGAGACCCT GAACAAT CCT GCAGAGAACAGC 
CCCAATCACCGTCCTTACACGGCCTCTGATTTCATAGAAGGGATCTACCGAACAGAAAGGGACAAAGGGACATTGTAT 
GAGCTCAC CTT CAAAGGGGACCACAAACAC GAATT CAAACG GCTCAT CTTATTTCGACCATT CAGCCCCAT CATGAAA 
GTGAAAAATGAAAAGCTCAACATGGCCAACACGCTTATCAATGTTATCGTGCCTCTAGCAAAAAGGGTGGACAAGTTC 
CGGCAGTT CAT GCAGAAT TT CAG GGAGATGT GCAT T GAGCAGGAT GGGAGAGT CCATCTCACT GTT GTTTACTTT G GG 
AAAGAAGAAATAAAT GAAGT CAAAGGAATACTT GAAAACACT TCCAAAGCT G CCAACTT CAGGAACTTTACCTTCAT C 
CAGCTGAATGGAGAATTTTCTCGGGGAAAGGGACTTGATGTTGGAGCCCGCTTCTGGAAGGGAAGCAACGTCCTTCTC 
TTTTTCT GTGAT GT GGACAT CTACTT CACAT CT GAAT T CCT CAATACGT GTAGGCT GAATACACAGCCAGGGAAGAAG 
GTATTTTATCCAGTTCTTTTCAGTCAGTACAATCCTGGCATAATATACGGCCACCATGATGCAGTCCCTCCCTTGGAA 
CAGCAG CT GGT CATAAAGAAG GAAACTGGAT TT T GGAGAGACTTT GGATT T GGGATGACGT GT CAGTATCGGT CAGAC 
T T CAT CAATATAGGT GGGTT T GAT CTGGACATCAAAGGCT GGGGCGGAGAGGAT GT GCACCTT TAT CGCAAGT AT CTC 
CACAGCAACCT CATAGT GGTACGGAC GC CT GT GCGAGGACT CTTC CACCT CTGG CAT GAGAAGCGCTGCAT GGACGAG 
CT GACC CCCGAGCAGT ACAAGAT GT GCATGCAGTC CAAGGC CATGAACGAGGCAT CCCAC G GCCAGCT GGGCAT GCTG 
GT GTT CAG GC^CGAGATAGAGGCTCACCTTC GCAAACAGAAACAGAAGACAAGTAGC^ 

GAAGGATT GTGGGAGACACTT TTTCTTT CCTTTT GCAATTACT GAAAGT GGCT GCAACAGAGAAAA.GACT T CCATAAA 
GGACGACAAAAGAATTGGACTGATGGGTCAGAGATGAG?iAAGCCTCCGATTTCTCTCTGTTGGGCTTTTTACAACAGA 
AATCAAAATCTCCGCTTTGCCTGCAAAAGTAACCCAGTTGCACCCTGTGAAGTGTCTGACAAAGGCAGAATGCTTGTG 
AGATTATAAGCCTAATGGTGTGGAGGTTTTGATGGTGTTTACAATACACTGAGACCTGTTGTTTTGTGTGCTCATTGA 
AATAT T CATGATTTAAGAGCAGTTTT GTAAAAAATT CATTAGCAT GAAAGGCAAGCATATT TCT CCTCATATGAAT GA 
G C CTATCAGCAGGGCT CT AGTTT CTAGGAAT GCTAAAATAT CAGAAGGCAGGAGAGGAGATAGGCT TATTATGATACT 
AGT GAGTACAT TAAGTAAAATAAAAT GGACCAGAAAAGAAAAGAAACCAT AAATAT CGT GT CATATTTT C CCCAAGAT 
TAACCAAAAATAATCTGCTTATCTTTTTGGTTGTCCTTTTAACTGTCTCCGTTTTTTTCTTTTATTTAAAAATGCACT 
T TTTTT CC CT T GT GAGTTAT AGT CT GCTTAT TTAATTAC CACTT T GCAAGCCTTACAAGAGAGCACAAGT T GG C CTAC 
ATT TTTATAT TTTT T AAGAAGATACTTTGAGAT GCATTAT GAGAACTT T CAGTT CAAAGCATCAAATT GAT GC CAT AT 
CCAAGGACAT GCCAAATGCT GATT CT GT CAGGCACT GAAT GTCAGGCATT GAGACATAGGGAAGGAAT G GTTT GTACT 
AAT ACAGACGTACAGATACT TTCT CT GAAGAGTATT TT C GAAGAGGAGCAACTGAACACTGGAGGAAAAGAAAATGAC 
ACTTT CTGCTTTACAGAAAAGGAAACTCATT CAGACTGGT GATAT CGT GAT GTAC CTAAAAGT CAGAAACCACATTTT 
CTCCTCAGAAGTAGGGACCGCTTTCTTACCTGTTTAAATAAACCAAAGTATACCGTGTGAACCAAACAATCTCTTTTC 
AAAACAGGGT G CT CCT CCT GGCTT CT GGCTT CCATAAGAAGAAAT GGAGAAAAATATATATATATATATATATATT GT 
GAAAGATCAAT CCAT CTGC CAGAAT CTAGTGGGAT GGAAGTTT TT GCTACATGTTAT C CACCC CAGGCCAGGT GGAAG 
T AACT GAAT T ATTTTT TAAATTAAGCAGT T CTACT CAAT CACCAAGAT GCT TCT GAAAATT GCATT TTATTACCAT TT 
CAAACTATTTT TT AAAAATAAATACAGTT AACATAGAGT GGTTT CT TCAT TCAT GT GAAAAT TATTAGCCAGCAC CAG 
AT GCAT GAGCTAATTATCT CTTT GAGTCCTT GCT T CTGTTT GCT CACAGTAAACTCAT T GTTT AAAAGCTT CAAGAAC 
ATTC^AGCTGTTGGTGTGTTAAAAAATGCATTGTATTGATTTGTACTGGTAGTTTATGAAATTTAATTAAAACACAGG 
CCAT GAAT GGAAGGTGGTATTGCACAGCTAATAAAATAT GATTTGT GGATAT GAA 
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FIGURE 72 

MMKTORGLIAWISRVWLLVLLCCAISVLYMIACTP 

EQHRNYVS SLKRQ I AQLKEELQERSEQLRNGQYQAS DAAGLGLDRS PPEKTQADLLAFLHS QVDK 

AEWAGVK1ATEYAAVPFDSFTLQKVYQLETGLTRHPEEKPVRKDKRDELVEAIESALETLNNPA 

ENSPNHRPYTASDFIEGIYRTERDKGTLYELTFKGDHKHEFKRLILFRPFSPIMKVKNEKLNMAN 

TLIOTIVPI^RVDKFRQFMQNFREMCIEQDGRVHLTVVTFGKEEINEVKGILENTSKAANFRNF 

TFIQLNGEFSRGKGLDVGARFWKGSNVLLFFCDVDIYFTSEFLNTCRLNTQPGKKVFYPVLFSQY 

NPGIIYGHHDAVPPLEQQLVIKKETGFWRDFGFGMTCQYRSDFINIGGFDLDIKGWGGEDVHLYR 

KYLHSNLIVVRTPTOGLFHLWHEKRCMDELTPEQYKMCMQSKAMNEASHGQLGMLVFRHEIEM 

RKQKQKT S SKKT 

Important features: 
Signal peptide: 

amino acids 1-27 

N-glycosylation sites. 

amino acids 315-319, 324-328 

N-myristoylation sites. 

amino acids 96-102, 136-142, 212-218, 311-317, 339-345, 393-399 

Amidation site. 

amino acids 377-381 
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FIGURE 73 

GAGACTGCAGAGGGAGATAAAGAGAGAGGGCAAAGAGGCAGCAAGAGATTTGTCCTGGGGATCCA 
GAAACCCATGATACCCTACTGAACACCGAATCCCCTGGAAGCCCACAGAGACAGAGACAGCAAGA 
GAAGCAGAGATAAATACACTCACGCCAGGAGCTCGCTCGCTCTCTCTCTCTCTCTCTCACTCCTC 
CCTCCCTCTCTCTCTGCCTGTCCTAGTCCTCTAGTCCTCAAATTCCCAGTCCCCTGCACCCCTTC 
CTGGGACACT ATGT TGTTCTCCGCCCTCCTGCTGGAGGTGATTTGGATCCTGGCTGCAGATGGGG 
GTCAACAC T GGACG TATGAGGGCCCACAT GGT CAGGACCAT T GGCCAGCC T CT TACCCTGAGTGT 
GGAAACAATGCCCAGTCGCCCATCGATAT TCAGACAGACAGTGTGACATTT GACCCTGATT TGCC 
TGCTCTGCAGCCCCACGGATATGACCAGCCTGGCACCGAGCCTTTGGACCTGCACAACAATGGCC 
ACACAGTGCAACTCTCTCTGCCCTCTACCCTGTATCTGGGTGGACTTCCCCGAAAATATGTAGCT 
GCCCAGCTCCACCTGCACTGGGGTCAGAAAGGATCCCCAGGGGGGTCAGAACACCAGATCAACAG 
TGAAGCCACATTTGCAGAGCTCCACATTGTACATTATGACTCTGATTCCTATGACAGCTTGAGTG 
AGGCTGCTGAGAGGCCTCAGGGCCTGGCTGTCCTGGGCATCCTAATTGAGGTGGGTGAGACTAAG 
AATATAGCTTATGAAGACATTCTGAGTCACTTGCATGAAGTCAGGCATAAAGATCAGAAGACCTC 
AGTGCCTCCCTTCAACCTAAGAGAGCTGCTCCCCAAACAGCTGGGGCAGTACTTCCGCTACAATG 
GCTCGCTCACAACTCCCCCTTGCTACCAGAGTGTGCTCTGGACAGTTTTTTATAGAAGGTCCCAG 
AT T T CAAT GGAACAGC TGGAAAAGC T TCAGGGGACAT TGT T CT C CACAGAAGAGGAGCCCT C TAA 
GCTTCTGGTACAGAACTACCGAGCCCTTCAGCCTCTCAATCAGCGCATGGTCTTTGCTTCTTTCA 
TCCAAGCAGGATCCTCGTATACCACAGGTGAAATGCTGAGTCTAGGTGTAGGAATCTTGGTTGGC 
TGTCTCTGCCTTCTCCTGGCTGTTTATTTCATTGCTAGAAAGATTCGGAAGAAGAGGCTGGAAAA 
CCGAAAGAGTGT GGTC TT CACCTCAGCACAAGCC ACGAC TGAGGC ATAAA T T CC T TCTCAGATAC 
CATGGATGTGGATGACTTCCCTTCATGCCTATCAGGAAGCCTCTAAAATGGGGTGTAGGATCTGG 
CCAGAAACACTGTAGGAGTAGTAAGCAGAT GTCCTCCTTCCCCTGGACATCTCT TAGAGAGGAAT 
GGACCCAGGCTGTCATTCCAGGAAGAACTGCAGAGCCTTCAGCCTCTCCAAACATGTAGGAGGAA 
ATGAGGAAATCGCTGTGTTGTTAATGCAGAGANCAAAGTCTGTTTAGTTGCAGGGGAAGTTTGGG 
ATATACCCCAAAGTCCTCTACCCCCTCACTTTTATGGCCCTTTCCCTAGATATACTGCGGGATCT 
C T CC T TAG GAT AAAGAGT T GC T G T T GAAG T T G TAT AT T T T T GAT CAAT AT AT T T GGAAAT TAAAG 
TTTCTGACTTT 
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FIGURE 7 A 

MLFSALLLEVIWILAADGGQHWTYEGPHGQDHWPASYPECGNNAQSPIDIQTDSVTFDPDLPALQ 
PHGYDQPGTEPLDLHNNGHTVQLSLPSTLYLGGLPRKYVAAQLHLHWGQKGSPGGSEHQINSEAT 
FAELHIVHYDSDSYDSLSEAAERPQGLAVLGILIEVGETKNIAYEHILSHLHEVRHKDQKTSVPP 
FNLRELLPKQLGQYFRYNGSLTTPPCYQSVLWTVFYRRSQISMEQLEKLQGTLFSTEEEPSKLLV 
QNYRALQPLNQRWFASFIQAGSSYTTGEMLSLGVGILVGCLCLLLAVYFIARKIRKKRLENRKS 
WFTSAQATTEA 

Important features of the protein: 
Signal peptide: 

amino acids 1-15 

Transmembrane domain: 

amino acids 291-310 

N-glycosylation site. 

amino acids 213-216 

Eukaryotic-type carbonic anhydrases proteins 

amino acids 197-245, 104-140, 22-69 
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FIGURE 7S 



TGCCGCTGCCGCCGCTGCTGCTGTTGCTCCTGGCGGCGCCTTGGGGACGGGCAGTTCCCTGTGTC 
TCT GGTGGT T T GGC TAAACCTGCAAACAT CACCT TC T TAT CCAT CAACAT GAAGAATGTCCTACA 
ATGGACTCCACCAGAGGGTCTTCAAGGAGT TAAAGT TACT TACACTGTGCAGTATTTCATCACAA 
ATTGGCCCACCAGAGGTGGCACTGACTACAGATGAGAAGTCCATTTCTGTTGTCCTGACAGCTCC 
AGAGAAGTGGAAGAGAAATCCAGAAGACCT TCCTGTT TCCATGCAACAAATATACTCCAATCTGA 
AGTATAACGTGTCTGTGTTGAATACTAAATCAAACAGAACGTGGTCCCAGTGTGTGACCAACCAC 
ACGCTGGTGCTCACCTGGCTGGAGCCGAACACTCTTTACTGCGTACACGTGGAGTCCTTCGTCCC 
AGGGCCCCCTCGCCGTGCTCAGCCTTCTGAGAAGCAGTGTGCCAGGACTTTGAAAGATCAATCAT 
CAGAGTTCAAGGCTAAAATCATCTTCTGGTATGTTTTGCCCATATCTATTACCGTGTTTCTTTTT 
TCTGTGATGGGCTATTCCATCTACCGATATATCCACGTTGGCAA?VGAGAAACACCCAGCAAATTT 
GATTTTGATTTATGGAAATGAATTTGACAAAAGATTCTTTGTGCCTGCTGAAAAAATCGTGATTA 
ACT T TAT CACCCTCAATAT CTCGGAT GAT TCTAAAAT T TC TCATCAGGATATGAGTT TACTGGGA 
AAAAGCAGTGATGTATCCAGCCTTAATGATCCTCAGCCCAGCGGGAACCTGAGGCCCCCTCAGGA 
GGAAGAGGAGGTGAAACAT T TAGGGTATGCT TCGCAT T T GATGGAAAT T T T T T GT GAC T CT GAAG 
AAAACACGGAAGGTACTTCTCTCACCCAGCAAGAGTCCCTCAGCAGAACAATACCCCCGGATAAA 
ACAGT CAT T GAATATGAATAT GATGTCAGAACCACTGACATTT GTGCGGGGC CTGAAGAGCAGGA 
GCT CAGT T TGCAGGAGGAGG T GT CCACACAAGGAACAT TAT TGGAGT CGCAGGCAGCGT TGGCAG 
TCTTGGGCCCGCAAACGTTACAGTACTCATACACCCCTCAGCTCCAAGACTTAGACCCCCTGGCG 
C AG GAGC AC AC AGAC T CGGAG GAG G G G C C GGAGGAAGAGC CAT C GAC GAC C C T GG TC GAC T GGGA 
TCCCCAAACTGGCAGGCTGTGTATTCCTTCGCTGTCCAGCTTCGACCAGGATTCAGAGGGCTGCG 
AGCCTTCTGAGGGGGATGGGCTCGGAGAGGAGGGTCTTCTATCTAGACTCTATGAGGAGCCGGCT 
CCAGACAGGCCACCAGGAGAAAATGAAACCTATC TCATGCAATT CAT GGAGGAATGGGGGT TATA 
TGTGCAGATGGAAAACTGATGCCAACACTTCCTTTTGCCTTTTGTTTCCTGTGCAAACAAGTGAG 
TCACCCCTTTGATCCCAGCCATAAAGTACCTGGGATGAAAGAAGTTTTTTCCAGTTTGTCAGTGT 
CTGTGAGAATTACTTATTTCTTTTCTCTATTCTCATAGCACGTGTGTGATTGGTTCATGCATGTA 
GGTCTCTTAACAATGATGGTGGGCCTCTGGAGTCCAGGGGCTGGCCGGTTGTTCTATGCAGAGAA 
AGCAGT CAATAAAT GT TT GCCAGAC T GGGT GCAGAAT T TAT TCAGGT GGGT GT 
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FIGURE 76 

MSYNGLHQRVFKELKLLTLCSISSQIGPPEVALTTDEKSISVVLTAPEKWKRNPEDLEVSMQQIY 
SNLKYNVSVLNTKSNRTWSQCVTNHTLVLTWLEPNTLYCVHVESFVPGPPRRAQPSEKQCARTLK 
DQSSEFKAKIIFWYVLPISITVFLFSVMGYSIYRYIHVGKEKHPANLILIYGNEFDKRFFVPAEK 
I VINFI TLNI SDDSKI SHQDMSLLGKSSDVS SLNDPQPS GNLRPPQEEEEVKHLGYASHLME I FC 
DSEENTEGTSLTQQESLSRTIPPDKTVIEYEYDVRTTDICAGPEEQELSLQEEVSTQGTLLESQA 
ALAVLGPQTLQYSYTPQLQDLDPLAQEHTDSEEGPEEEPSTTLVDWDPQTGRLCIPSLSSFDQDS 
EGCEPSEGDGLGEEGLLSRLYEEPAPDRPPGENETYLMQFMEEWGLYVQMEM 

Important features; 
Signal peptide: 

amino acids 1-28 

Transmembrane domain: 

amino acids 140-163 

N-glycosylation sites. 

amino acids 71-74, 80-83, 89-92, 204-207, 423-426 
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FIGURE 77 

GAGGAGCGGGCCGAGGACTCCAGCGTGCCCAGGTCTGGCATCCTGCACTTGCTGCCCTCTGACAC 
CTGGGAAGATGGCCGGCCCGTGGACCTTCACCCTTCTCTGTGGTTTGCTGGCAGCCACCTTGATC 
CAAGCCACCCTCAGTCCCACTGCAGTTCTCATCCTCGGCCCAAAAGTCATCAAAGAAAAGCTGAC 
ACAGGAGCTGAAGGACCACAACGCCACCAGCATCCTGCAGCAGCTGCCGCTGCTCAGTGCCATGC 
GGGAAAAGCCAGCCGGAGGCATCCCTGTGCTGGGCAGCCTGGTGAACACCGTCCTGAAGCACATC 
ATCTGGCTGAAGGTCATCACAGCTAACATCCTCCAGCTGCAGGTGAAGCCCTCGGCCAATGACCA 
GGAGCTGCTAGTCAAGATCCCCCTGGACATGGTGGCTGGATTCAACACGCCCCTGGTCAAGACCA 
TCGTGGAGTTCCACATGACGACTGAGGCCCAAGCCACCATCCGCATGGACACCAGTGCAAGTGGC 
CCCACCCGCCTGGTCCTCAGTGACTGTGCCACCAGCCATGGGAGCCTGCGCATCCAACTGCTGTA 
TAAGCTCTCCTTCCTGGTGAACGCCTTAGCTAAGCAGGTCATGAACCTCCTAGTGCCATCCCTGC 
CCAATCTAGTGAAAAACCAGCTGTGTCCCGTGATCGAGGCTTCCTTCAATGGCATGTATGCAGAC 
CTCCTGCAGCTGGTGAAGGTGCCCATTTCCCTCAGCATTGACCGTCTGGAGTTTGACCTTCTGTA 
TCCTGCCATCAAGGGTGACACCATTCAGCTCTACCTGGGGGCCAAGTTGTTGGACTCACAGGGAA 
AGGTGACCAAGTGGTTCAATAACTCTGCAGCTTCCCTGACAATGCCCACCCTGGACAACATCCCG 
TTCAGCCTCATCGTGAGTCAGGACGTGGTGAAAGCTGCAGTGGCTGCTGTGCTCTCTCCAGAAGA 
ATTCATGGTCCTGTTGGACTCTGTGCTTCCTGAGAGTGCCCATCGGCTGAAGTCAAGCATCGGGC 
TGAT CAAT GAAAAGGC TGCAGATAAGC TGGGATC TACCCAGAT CGT GAAGAT CC TAAC T CAGGAC 
ACTCCCGAGTTTTTTATAGACCAAGGCCATGCCAAGGTGGCCCAACTGATCGTGCTGGAAGTGTT 
TCCCTCCAGTGAAGCCCTCCGCCCTTTGTTCACCCTGGGCATCGAAGCCAGCTCGGAAGCTCAGT 
TTTACACCAAAGGTGACCAACTTATACTCAACTTGAATAACATCAGCTCTGATCGGATCCAGCTG 
ATGAACT CTGGGAT T GGC T GGT T CCAACCT GATGTT CTGAAAAACAT CAT CACTGAGATCATCCA 
CTCCATCCTGCTGCCGAACCAGAATGGCAAATTAAGATCTGGGGTCCCAGTGTCATTGGTGAAGG 
CCTTGGGATTCGAGGCAGCTGAGTCCTCACTGACCAAGGATGCCCTTGTGCTTACTCCAGCCTCC 
TTGTGGAAACCCAGCTCTCCTGTCTCCCAG TGAA GACTTGGATGGCAGCCATCAGGGAAGGCTGG 
GTCCCAGCTGGGAGTATGGGTGTGAGCTCTATAGACCATCCCTCTCTGCAATCAATAAACACTTG 
CCTGTGAAAAA 
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FIGURE 78 

MAGPWTFTLLCGLLAATLIQATLSPTAVLILGPKVIKEKLTQELKDHNATSILQQLPLLSAMREK 
PAGGIPVLGSLWTVLKHIIWLKVITANILQLQVKPSANDQELLVKIPLDMVAGFNTPLVKTIVE 
FHMTTEAQATIRMDTSASGPTRLVLSDCATSHGSLRIQLLYKLSFLVNALAKQVMNLLVPSLPNL 
VKNQLCPVIEAS FNGMYADLLQLVKVPI SLS IDRLEFDLLYPAIKGDT I QLYLGAKLLDSQGKVT 
KWFNNSAASLTMPTLDNIPFSLIVSQDVVKAAVAAVLSPEEFMVLLDSVLPESAHRLKSSIGLIN 
EKAADKLGSTQIVKILTQDTPEFFIDQGHAKVAQLIVLEVFPSSEALRPLFTLGIEASSEAQFYT 
KGDQL I LNLNN I S S DRI QLMNS G I GW FQP DVLKN 1 1 TE I IHS I LLPNQNGKLRS GVPVS LVKALG 
FE AAE SSL TKDAL VL T PAS LWKP S S P VS Q 

Important features of the protein: 
Signal peptide: 

amino acids 1-21 

N-glycosylation sites. 

amino acids 48-51, 264-267, 401-404 

Glycosaminoglycan attachment site. 

amino acids 412-415 

LBP / BPI / CETP family proteins. 

amino acids 407-457 
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FIGURE 7Q 

GAGAGAAGTCAGCCTGGCAGAGAGACTCTGAAATGAGGGATTAGAGGTGTTCAAGGAGCAAGAGC 
TTCAGCCTGAAGACAAGGGAGCAGTCCCTGAAGACGCTTCTACTGAGAGGTCTGCCATGGCCTCT 
CTTGGCCTCCAACTTGTGGGCTACATCCTAGGCCTTCTGGGGCTTTTGGGCACACTGGTTGCCAT 
GCTGCTCCCCAGCTGGAAAACAAGTTCTTATGTCGGTGCCAGCATTGTGACAGCAGTTGGCTTCT 
CCAAGGGCCTCTGGATGGAATGTGCCACACACAGCACAGGCATCACCCAGTGTGACATCTATAGC 
ACCCTTCTGGGCCTGCCCGCTGACATCCAGGCTGCCCAGGCCATGATGGTGACATCCAGTGCAAT 
CTCCTCCCTGGCCTGCATTATCTCTGTGGTGGGCATGAGATGCACAGTCTTCTGCCAGGAATCCC 
GAGCCAAAGACAGAGTGGCGGTAGCAGGTGGAGTCTTTTTCATCCTTGGAGGCCTCCTGGGATTC 
ATTCCTGTTGCCTGGAATCTTCATGGGATCCTACGGGACTTCTACTCACCACTGGTGCCTGACAG 
CATGAAATTTGAGATTGGAGAGGCTCTTTACTTGGGCATTATTTCTTCCCTGTTCTCCCTGATAG 
CTGGAATCATCCTCTGCTTTTCCTGCTCATCCCAGAGAAATCGCTCCAACTACTACGATGCCTAC 
CAAGCCCAACCTCTTGCCACAAGGAGCTCTCCAAGGCCTGGTCAACCTCCCAAAGTCAAGAGTGA 
GTTCAATTCCTACAGCCTGACAGGGTATGTG TGAA GAACCAGGGGCCAGAGCTGGGGGGTGGCTG 
GGTCTGTGAAAAACAGTGGACAGCACCCCGAGGGCCACAGGTGAGGGACACTACCACTGGATCGT 
GTCAGAAGGTGC TGC TGAGGAT AGACTGACT TT GGCCAT T GGAT TGAGCAAAGGCAGAAATGGGG 
GCTAGTGTAACAGCATGCAGGTTGAATTGCCAAGGATGCTCGCCATGCCAGCCTTTCTGTTTTCC 
TCACCTTGCTGCTCCCCTGCCCTAAGTCCCCAACCCTCAACTTGAAACCCCATTCCCTTAAGCCA 
GGACTCAGAGGATCCCTTTGCCCTCTGGTTTACCTGGGACTCCATCCCCAAACCCACTAATCACA 
TCCCACTGACTGACCCTCTGTGATCAAAGACCCTCTCTCTGGCTGAGGTTGGCTCTTAGCTCATT 
GCTGGGGATGGGAAGGAGAAGCAGTGGCTTTTGTGGGCATTGCTCTAACCTACTTCTCAAGCTTC 
CCTCCAAAGAAACTGATTGGCCCTGGAACCTCCATCCCACTCTTGTTATGACTCCACAGTGTCCA 
GAC TAAT T TGTGCATGAAC TGAAATAAAACCATCC TACGGT AT CCAGGGAACAGAAAGCAGGAT G 
CAGGATGGGAGGACAGGAAGGCAGCCT GGGACAT T TAAAAAAATA 



FIGURE 80 



MASLGLQLVGYILGLLGLLGTLVAMLLPSWKTSSYVGASIVTAVGFSKGLWMECATHSTGITQCD 
I YS TLLGLPAD I QAAQAMMVTS S AI S S LAC 1 1 S WGMRCTVFCQE SRAKDRVAVAGGVFFI LGGL 
LGFIPVAWNLHGILRDFYSPLVPDSMKFEIGEALYLGIISSLFSLIAGIILCFSCSSQRNRSNYY 
DAYQAQPLATRSSPRPGQPPKVKSEFNSYSLTGYV 

Important features of the protein: 
Signal peptide: 

amino acids 1-24 

Transmembrane domains: 

amino acids 82-102, 117-140, 163-182 

N-glycosylation site. 

amino acids 190-193 

PMP-22 / EMP / MP20 family proteins. 

amino acids 4 6-59 
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FIGURE 81 



CCCACGCGTCCGCGCCTCTCCCTTCTGCTGGACCTTCCTTCGTCTCTCCATCTCTCCCTCCTTTC 
CCCGCGTTCTCTTTCCACCTTTCTCTTCTTCCCACCTTAGACCTCCCTTCCTGCCCTCCTTTCCT 
GCCCACCGCTGCTTCCTGGCCCTTCTCCGACCCCGCTCTAGCAGCAGACCTCCTGGGGTCTGTGG 
GTTGATCTGTGGCCCCTGTGCCTCCGTGTCCTTTTCGTCTCCCTTCCTCCCGACTCCGCTCCCGG 
ACCAGCGGCCTGACCCTGGGGAAAGGATGGTTCCCGAGGTGAGGGTCCTCTCCTCCTTGCTGGGA 
CTCGCGCTGCTCTGGTTCCCCCTGGACTCCCACGCTCGAGCCCGCCCAGACATGTTCTGCCTTTT 
CCATGGGAAGAGATACTCCCCCGGCGAGAGCTGGCACCCCTACTTGGAGCCACAAGGCCTGATGT 
j~ 5 j. ACTGCCTGCGCTGTACCTGCTCAGAGGGCGCCCATGTGAGTTGTTACCGCCTCCACTGTCCGCCT 
Q GTCCACTGCCCCCAGCCTGTGACGGAGCCACAGCAATGCTGTCCCAAGTGTGTGGAACCTCACAC 
TCCCTCTGGACTCCGGGCCCCACCAAAGTCCTGCCAGCACAACGGGACCATGTACCAACACGGAG 

fit 

AGATCTTCAGTGCCCATGAGCTGTTCCCCTCCCGCCTGCCCAACCAGTGTGTCCTCTGCAGCTGC 

Is] 

i «f ACAGAGGGCCAGATCTACTGCGGCCTCACAACCTGCCCCGAACCAGGCTGCCCAGCACCCCTCCC 

y 1 

ACT GCCAGACTCC T GCTGCCAAGCC T GCAAAGATGAGGCAAGT GAGCAAT C GGATGAAGAGGACA 
ill GTGTGCAGTCGCTCCATGGGGTGAGACATCCTCAGGATCCATGTTCCAGTGATGCTGGGAGAAAG 
■ B AGAGGCCCGGGCACCCCAGCCCCCACTGGCCTCAGCGCCCCTCTGAGCTTCATCCCTCGCCACTT 
: 3 p CAGACCCAAGGGAGCAGGCAGCACAACTGTCAAGATCGTCCTGAAGGAGAAACATAAGAAAGCCT 
GTGTGCATGGCGGGAAGACGTACTCCCACGGGGAGGTGTGGCACCCGGCCTTCCGTGCCTTCGGC 
f|| CCCTTGCCCTGCATCCTATGCACCTGTGAGGATGGCCGCCAGGACTGCCAGCGTGTGACCTGTCC 
CI CACCGAGTACCCCTGCCGTCACCCCGAGAAAGTGGCTGGGAAGTGCTGCAAGATTTGCCCAGAGG 
U * ACAAAGCAGACCCTGGCCACAGTGAGATCAGTTCTACCAGGTGTCCCAAGGCACCGGGCCGGGTC 

CTCGTCCACACATCGGTATCCCCAAGCCCAGACAACCTGCGTCGCTTTGCCCTGGAACACGAGGC 
CTCGGACT TGGT GGAGATC TAC CTCT GGAAGC T GGTAAAAGATGAGGAAAC T GAGGC TCAGAGAG 
G TGAAGTACCT GGCCCAAGGCCACACAGCCAGAATC T TCCACT T GAC T CAGAT CAAGAAAGTCAG 
GAAGCAAGACT T CCAGAAAGAGGCACAGCACT TCCGACT GC TCGCT GGCCC CCACGAAGGT CAC T 
GGAACGTCTTCCTAGCCCAGACCCTGGAGCTGAAGGTCACGGCCAGTCCAGACAAAGTGACCAAG 
ACAT AACAAAGACCTAACAGTT GCAGATAT GAGC TGTATAATT GT TGT T AT TAT AT AT TAATAAA 
TAAGAAGT T GCAT TACCC TCAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 82 



MVPEWVLSSLLGLALLWFPLDSHARARPDMFCLFHGKRYSPGESWHPYLEPQGLMYCLRCTCSE 
GAHVSCYRLHCPPVHCPQPVTEPQQCCPKCVEPHTPSGLRAPPKSCQHNGTMYQHGEIFSAHELF 
PSRLPNQCVLCSCTEGQIYCGLTTCPEPGCPAPLPLPDSCCQACKDEASEQSDEEDSVQSLHGVR 
HPQDPCSSDAGRKRGPGTPAPTGLSAPLSFIPRHFRPKGAGSTTVKIVLKEKHKKACVHGGKTYS 
HGEVWHPAFRAFGPLPCILCTCEDGRQDCQRVTCPTEYPCRHPEKVAGKCCKICPEDKADPGHSE 
ISSTRCPKAPGRVLVHTSVSPSPDNLRRFALEHEASDLVEIYLWKLVKDEETEAQRGEVPGPRPH 
SQNLPLDSDQESQEARLPERGTALPTARWPPRRSLERLPSPDPGAEGHGQSRQSDQDITKT 

Signal peptide: 

amino acids 1-25 
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FIGURE 83 

GACAGCTGTGTCTCGATGGAGTAGACTCTCAGAACAGCGCAGTTTGCCCTCCGCTCACGCAGAGCCTCTCC 
GTGGCTTCCGCACCTTGAGCATTAGGCCAGTTCTCCTCTTCTCTCTAATCCATCCGTCACCTCTCCTGTCA 
TCCGTTTCCATGCCGTGAGGTCCATTCACAGAACACATCC ATGG CTCTCATGCTCAGTTTGGTTCTGAGTC 
TCCTCAAGCTGGGATCAGGGCAGTGGCAGGTGTTTGGGCCAGACAAGCCTGTCCAGGCCTTGGTGGGGGAG 
GACGCAGCATTCTCCTGTTTCCTGTCTCCTAAGACCAATGCAGAGGCCATGGAAGTGCGGTTCTTCAGGGG 
CC AGT T CT CT AG CGTGGT CC ACCT CT AC AGGGACGGGAAGGACC AG CC AT TT AT GC AGAT GC CACAGT AT C 
AAGGCAGGACAAAACTGGTGAAGGATTCTATTGCGGAGGGGCGCATCTCTCTGAGGCTGGAAAACATTACT 
GTGTTGGATGCTGGCCTCTATGGGTGCAGGATTAGTTCCCAGTCTTACTACCAGAAGGCCATCTGGGAGCT 
AC AGGT GT CAGC ACT GGGCT CAGT TC CTCT CATTT C C AT C ACGGGAT AT GT T GAT AGAGACATC C AGCTAC 
TCTGTCAGTCCTCGGGCTGGTTCCCCCGGCCCACAGCGAAGTGGAAAGGTCCACAAGGACAGGATTTGTCC 
AC AG AC T C C AGGAC AAAC AG AG ACAT GC AT GGCCTGTTTGAT GT GG AGAT CT CT CT GACCGT CC AAGAGAA 
CGCCGGGAGC AT AT CCTGTT CC AT GC GGC ATGCTC ATCT GAGCC GAGAGGTGGAAT CCAGGGT ACAGATAG 
GAGATACCTTTTTCGAGCCTATATCGTGGCACCTGGCTACCAAAGTACTGGGAATACTCTGCTGTGGCCTA 
T T TT T T GGC AT T GT TGGACT GAAG AT TTTCTT CTCC AAATTCCAGTGGAAAATC C AGGCGGAAC TGGACT G 
GAGAAG AAAGC ACGGACAGGCAGAAT TGAGAGACGC CCGG AAAC ACGC AGT GGAGGTGAC TCTGGAT C CAG 
AG ACGGCT CACCCGAAGCT CTGCGTT T CT GAT CTGAAAACTGT AACCC AT AGAAAAGCT CCCCAGG AGGT G 
CCTCACTCTGAGAAGAGATTTACAAGGAAGAGTGTGGTGGCTTCTCAGAGTTTCCAAGCAGGGAAACATTA 
CT GGGAGGT G GACGGAGG AC AC AAT AAAAG GT GGCGCGT GGG AGTGTGCCGGGATGATGT GG AC AGGAGGA 
AGGAGT ACGT GACT TT GTCT CC CGAT C AT GGGT ACT GGGT CCTC AG ACTGAATGGAGAAC AT T TGT AT TT C 
ACATTAAATCCCCGTTTTATCAGCGTCTTCCCCAGGACCCCACCTACAAAAATAGGGGTCTTCCTGGACTA 
TGAGTGTGGGACCATCTCCTTCTTCAACATAAATGACCAGTCCCTTATTTATACCCTGACATGTCGGTTTG 
AAGGCT TATT GAGGCC CT AC AT TG AGTATC CGTC CT AT AATGAGCAAAAT GGAACT CCC ATAGTCATCTGC 
CCAGTC ACCCAGGAAT C AGAGAAAGAGGCCTCTT GGC AAAGGGC CTCT GC AAT C CCAGAGAC AAGC AACAG 
TGAGTCCTCCTCACAGGCAACCACGCCCTTCCTCCCCAGGGGTGAAATG TAGG ATGAATCACATCCCACAT 
TC TT CTT T AG GGAT AT T AAGGT CT CT CT CCC AGAT C C AAAGT CC CGCAGC AGCC GGCCAAGGT GGCTT CC A 
GATGAAGGGGGACTGGCCT GT C CACATGGG AGTC AGGT GTCATG GCTGCC CTGAGCTGGGAGGGAAGAAGG 
CTGACATT AC AT TT AGTTTGCT CT CACT CC AT CT GG CT AAGT GATCTT GAAAT ACC ACCT CT CAGGTGAAG 
AACCGT C AGGAATTCCCATCTC AC AGGCTGTGGTGT AGAT TAAGT AGACAAGGAATGT GAAT AAT GCT T AG 
ATCT TATT GATG AC AGAGTGTATCCT AATGGT TT GT TC AT TAT AT T AC ACTT TC AGT AAAAAAA 



AppJD=l 0063565 



Page 235 



FIGURE 84 

MALMLSLVLSLLKLGSGQWQVFGPDKPVQALVGEDAAFSCFLSPKTNAEAMEVRFFRGQFSSWH 
LYRDGKDQPFMQMPQYQGRTKLVKDSIAEGRISLRLENITVLDAGLYGCRISSQSYYQKAIWELQ 
VSALGSVPLISITGYVDRDIQLLCQSSGWFPRPTAKWKGPQGQDLSTDSRTNRDMHGLFDVEISL 
TVQENAGS ISC SMRHAHL S RE VE S RVQ I GDT FFEPI SWHLATKVLGI LCCGLFFGI VGLKI FFSK 
FQWKIQAELDWRRKHGQAELRDARKHAVEVTLDPETAHPKLCVSDLKTVTHRKAPQEVPHSEKRF 
TRKSWASQS FQAGKHYWEVDGGHNKRWRVGVCRDDVDRRKEYVTLS PDHGYWVLRLNGEHLYFT 
LNPRFISVFPRTPPTKIGVFLDYECGTISFFNINDQSLIYTLTCRFEGLLRPYIEYPSYNEQNGT 
PIVICPVTQESEKEASWQRASAIPETSNSESSSQATTPFLPRGEM 

Signal peptide: 

amino acids 1-17 

Transmembrane domain: 

amino acids 239-255 
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FIGURE 85 

AACAGACGTTCCCTCGCGGCCCTGGCACCTCTAACCCCAGACATGCTGCTGCTGCTGCTGCCCCT 
GCT CTGGGGGAGGGAGAGGGCGGAAGGACAGACAAGTAAACT GCT GACGATGCAGAG T TCCGT GA 
CGGTGCAGGAAGGCCTGTGTGTCCATGTGCCCTGCTCCTTCTCCTACCCCTCGCATGGCTGGATT 
TACCCTGGCCCAGTAGTTCATGGCTACTGGTTCCGGGAAGGGGCCAATACAGACCAGGATGCTCC 
AGTGGCCACAAACAACCCAGCTCGGGCAGTGTGGGAGGAGACTCGGGACCGATTCCACCTCCTTG 
GGGACCCACATACCAAGAATT GCAC C CT GAGCATCAGAGATGCCAGAAGAAGT GAT GCGGGGAGA 
TACTTCTTTCGTATGGAGAAAGGAAGTATAAAATGGAATTATAAACATCACCGGCTCTCTGTGAA 
TGTGACAGCCTTGACCCACAGGCCCAACATCCTCATCCCAGGCACCCTGGAGTCCGGCTGCCCCC 
AGAATCTGACCTGCTCTGTGCCCTGGGCCTGTGAGCAGGGGACACCCCGTATGATCTCCTGGATA 
GGGACCTCCGTGTCCCCCCTGGACCCCTCCACCACCCGCTCCTCGGTGCTCACCCTCATCCCACA 
GCCCCAGGACCATGGCACCAGCCTCACCTGTCAGGTGACCTTCCCTGGGGCCAGCGTGACCACGA 
ACAAGACCGTCCATCTCAACGTGTCCTACCCGCCTCAGAACTTGACCATGACTGTCTTCCAAGGA 
GACGGCACAGTATCCACAGTCTTGGGAAATGGCTCATCTCTGTCACTCCCAGAGGGCCAGTCTCT 
GCGCCTGGTCTGTGCAGTTGATGCAGTTGACAGCAATCCCCCTGCCAGGCTGAGCCTGAGCTGGA 
GAGGCCTGACCCTGTGCCCCTCACAGCCCTCAAACCCGGGGGTGCTGGAGCTGCCTTGGGTGCAC 
CTGAGGGATGCAGCTGAATTCACCTGCAGAGCTCAGAACCCTCTCGGCTCTCAGCAGGTCTACCT 
GAACGTCTCCCTGCAGAGCAAAGCCACATCAGGAGTGACTCAGGGGGTGGTCGGGGGAGCTGGAG 
CCACAGCCCTGGTCTTCCTGTCCTTCTGCGTCATCTTCGTTGTAGTGAGGTCCTGCAGGAAGAAA 
TCGGCAAGGCCAGCAGCGGGCGTGGGAGATACGGGCATAGAGGATGCAAACGCTGTCAGGGGTTC 
AGCC T CT CAGGGGCCCC T GAC T GAACCT T GGGCAGAAGACAGT CCCCCAGACCAGCC T CCCCCAG 
CTTCTGCCCGCTCCTCAGTGGGGGAAGGAGAGCTCCAGTATGCATCCCTCAGCTTCCAGATGGTG 
AAGCCTTGGGACTCGCGGGGACAGGAGGCCACTGACACCGAGTACTCGGAGATCAAGATCCACAG 
ATGAGAAAC T GCAGAGAC T CAC C C T GAT T GAGGGAT CACAGCCC C T C CAGG C AAG GGAGAAG T CA 
GAGGCT GAT TC T TGTAGAAT TAACAGCCCT CAACGTGAT GAGC TAT GATAACAC TAT GAAT TAT G 
T GCAGAGT GAAAAGCACACAGGCT T T AGAGTCAAAGTAT C TCAAACCT GAAT CCACACT GTGCCC 
TCCCT TTTATT TTTT TAACTAAAAGACAGACAAATTCCTA 



AppJD=10063565 



Page 237 



FIGURE 86 



MLLLLLPLLWGRERAEGQTSKLLTMQSSVTVQEGLCVHVPCSFSYPSHGWIYPGPWHGYWFRSG 
ANTDQDAPVATNNPARAVWEETRDRFHLLGDPHTKNCTLS IRDARRSDAGRYFFRMEKGS IKWNY 
KHHRLSVNVTALTHRPNILIPGTLESGCPQNLTCSVPWACEQGTPPMISWIGTSVSPLDPSTTRS 
SVLTL I PQPQDHGTS LTCQVT FPGAS VT TNKTVHLNVS YP PQNLTMTVFQGDGT VS TVLGNGS SL 
SLPEGQSLRLVCAVDAVDSNPPARLSLSWRGLTLCPSQPSNPGVLELPWVHLRDAAEFTCRAQNP 
LGSQQVYLNVSLQSKATSGVTQGWGGAGATALVFLSFCVIFVWRSCRKKSARPAAGVGDTGIE 
DANAVRGSASQGPLTEPWAEDSPPDQPPPASARSSVGEGELQYASLSFQMVKPWDSRGQEATDTE 
YSEIKIHR 

Signal peptide: 

amino acids 1-15 

Transmembrane domain: 

amino acids 351-370 



ill 
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FIGURE 87 



AGAAA.GCTGCACTCTGTTGAGCTCCAGGGCGCAGTGGAGGGAGGGAGTGAAGGAGCTCTCTGTAC 
GCAAGGAAAGTGCAGCT GAGACTCAGACAAGAT TACAATGAAC CAAC TCAGCT T CCTGC TGTTTC 
TCATAGCGACCACCAGAGGATGGAGTACAGATGAGGCTAATACTTACTTCAAGGAATGGACCTGT 
TCTTCGTCTCCATCTCTGCCCAGAAGCTGCAAGGAAATCAAAGACGAATGTCCTAGTGCATTTGA 
TGGCCTGTATTTTCTCCGCACTGAGAATGGTGTTATCTACCAGACCTTCTGTGACATGACCTCTG 
GGGGTGGCGGCTGGACCCTGGTGGCCAGCGTGGATGAGAATGACATGCGTGGGAAGTGCACGGTG 
GGCGATCGCTGGTCCAGTCAGCAGGGCAGCAAAGCAGACTACCCAGAGGGGGACGGCAACTGGGC 
CAACTACAACACCTTTGGATCTGCAGAGGCGGCCACGAGCGATGACTACAAGAACCCTGGCTACT 
ACGACATCCAGGCCAAGGACCTGGGCATCTGGCACGTGCCCAATAAGTCCCCCATGCAGCACTGG 
AGAAACAGCTC CCTGC TGAGGTACCGCACGGACACTGGC T TCC T CCAGACACT GGGACATAATC T 
GTT T GGCATC TACCAGAAATAT CCAGTGAAATATGGAGAAGGAAAGT GT TGGAC TGACAAC GGCC 
CGGT GAT CCC TG TGGTC TATGAT T T T GGC GAC GCCCAGAAAACAGCATCTT AT TACTCAC CC TAT 
GGCCAGCGGGAATTCACTGCGGGAT T TGT TCAG T TCAGGGTAT T T AATAACGAGAGAGCAGCCAA 
CGCCTTGTGTGCTGGAATGAGGGTCACCGGATGTAACACTGAGCATCACTGCATTGGTGGAGGAG 
GATACT T T CCAGAGGCCAGTCCCCAGCAGT GT GGAGAT TTTTCTGGTTTT GAT TGGAGTGGATAT 
GGAACTCATGTTGGTTACAGCAGCAGCCGTGAGATAACTGAGGCAGCTGTGCTTCTATTCTATCG 
TTGAGAGTTTTGTGGGAGGGAACCCAGACCTCTCCTCCCAACCATGAGATCCCAAGGATGGAGAA 
CAACTTACCCAGTAGCTAGAATGTTAATGGCAGAAGAGAAAACAATAAATCATATTGACTCAAGA 
AAAAAA 
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FIGURE 88 



MNQLSFLLFLIATTRGWSTDEANTYFKEWTCSSSPSLPRSCKEIKDSCPSAFDGLYFLRTENGVI 
YQTFCDMTSGGGGWTLVASVHENDMRGKCTVGDRWSSQQGSKADYPEGDGNWANYNT FGSAEAAT 
SDDYKNPGYYDIQAKDLGIWHVPNKSPMQHWRNSSLLRYRTDTGFLQTLGHNLFGIYQKYPVKYG 
EGKCWTDNGPVIPWYDFGDAQKTASYYSPYGQREFTAGFVQFRVFNNERAANALCAGMRVTGCN 
TEHHCI GGGG YFPEAS PQQCGDFS G FDW S GYGTHVGYS SSREI TEAAVLL FYR 

Important features: 
Signal peptide: 

amino acids 1-16 

N-glycosylation site. 

amino acids 163-167 

Glycosaminoglycan attachment sites. 

amino acids 74-78, 289-293 

N-myristoylation sites . 

amino acids 76-82, 115-121, 124-130, 253-259, 292-298 
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FIGURE 8q 



CTAGATTTGTCGGCTTGCGGGGAGACTTCAGGAGTCGCTGTCTCTGAACTTCCAGCCTCAGAGAC 
CGCCGCCCTTGTCCCCGAGGGCCATGGGCCGGGTCTCAGGGGTTGTGCCCTCTCGCTTCCTGACG 
CTCCTGGCGCATCTGGTGGTCGTCATCACCTTATTCTGGTCCCGGGACAGCAACATACAGGCCTG 
CCTGCCTCTCACGTTCACCCCCGAGGAGTATGACAAGCAGGACATTCAGCTGGTGGCCGCGCTCT 
CTGTCACCCTGGGCCTCTTTGCAGTGGAGCTGGCCGGTTTCCTCTCAGGAGTCTCCATGTTCAAC 
AGCACCCAGAGCCTCATCTCCATTGGGGCTCACTGTAGTGCATCCGTGGCCCTGTCCTTCTTCAT 
ATTCGAGCGTTGGGAGTGCACTACGTATTGGTACATTTTTGTCTTCTGCAGTGCCCTTCCAGCTG 
TCACTGAAATGGCTTTATTCGTCACCGTCTTTGGGCTGAAAAAGAAACCCTTCTGATTACCTTCA 
TGACGGGAACCTAAGGACGAAGCCTACAGGGGCAAGGGCCGCTTCGTATTCCTGGAAGAAGGAAG 
GCATAGGCTTCGGTTTTCCCCTCGGAAACTGCTTCTGCTGGAGGATATGTGTTGGAATAATTACG 
TCTTGAGTCTGGGATTATCCGCATTGTATTTAGTGCTTTGTAATAAAATATGTTTTGTAGTAACA 
T T AAG AC T T ATATACAGT T T TAG GGGAC AAT T AAAAAAAAAAAA 
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FIGURE qo 



Q 

a 
m 
w 

Lf! 
SI 
if! 

0 
ill 
ci 
ill 
0 
ru 



MGRVSGLVPSRFLTLLAHLVWITLFWSRDSNIQACLPLTFTPEEYDKQDIQLVAALSVTLGLFA 
VSIAGFLSGVSMFNSTQSLISIGAHCSASVALSFFIFERWECTTYWYIFVFCSALPAVTEMMiFV 
TVFGLKKKPF 

Transmembrane domain: 

amino acids 12-28 (type II), 51-66, 107-124 
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FIGURE qi 



CTGGGACCCCGAAAAGAGAAGGGGAGAGCGAGGGGACGAGAGCGGAGGAGGAAGATGCAACTGAC 
TCGCTGCTGCTTCGTGTTCCTGGTGCAGGGTAGCCTCTATCTGGTCATCTGTGGCCAGGATGATG 
GTCCTCCCGGCTCAGAGGACCCTGAGCGTGATGACCACGAGGGCCAGCCCCGGCCCCGGGTGCCT 
CGGAAGCGGGGCCACATCTCACCTAAGTCCCGCCCCATGGCCAATTCCACTCTCCTAGGGCTGCT 
GGCCCCGCCTGGGGAGGCTTGGGGCATTCTTGGGCAGCCCCCCAACCGCCCGAACCACAGCCCCC 
CACCCT CAGCCAAGGTGAAGAAAAT CT T TGGC TGGGGC GACT TCTAC TCCAACATCAAGACGGT G 
GCCCTGAACCTGCTCGTCACAGGGAAGATTGTGGACCATGGCAATGGGACCTTCAGCGTCCACTT 
Wfc CCAACACAAT GCCACAGGCCAGGGAAACATC TCCAT CAGCCTC GTGCCCCCCAG TAAAGC TGTAG 

□ AGTTCCACCAGGAACAGCAGATCTTCATCG7\AGCCAAGGCCTCCAAAATCTTCAACTGCCGGATG 
H GAGTGGGAGAAGGTAGAACGGGGCCGCCGGACCTCGCTTTGCACCCACGACCCAGCCAAGATCTG 
Hi CTCCCGAGACCACGCTCAGAGCTCAGCCACCTGGAGCTGCTCCCAGCCCTTCAAAGTCGTCTGTG 
I ff TCTACAT CGCC T TC TACAGCACGGACTAT CGGCT GG TCCAGAAGGT G TGCC CAGAT TACAACTAC 

fll CATAGTGATACCCCCTACTACCCATCTGGGTGACCCGGGGCAGGCCACAGAGGCCAGGCCAGGGC 
^' TGGAAGGACAGGCCTGCCCATGCAGGAGACCATCTGGACACCGGGCAGGGAAGGGGTTGGGCCTC 
AGGCAGGGAGGGGGGTGGAGACGAGGAGATGCCAAGTGGGGCCAGGGCCAAGTCTCAAGTGGCAG 
Tn AGAAAGGGTCCCAAGTGCTGGTCCCAACCTGAAGCTGTGGAGTGACTAGATCACAGGAGCACTGG 
p AGGAGGAG TGGGCTCTCTGTG CAGC C T CACAGGGC T T T GC C AC G GAG C C AC AGAGAGAT GC T GGG 

ftj T CC CCGAGGCC T GT GGGCAGGCCGAT CAGTGTGGCC CCAGAT CAAGTCAT GGGAGGAAGC TAAGC 

W CCTTGGTTCTTGCCATCCTGAGGAAAGATAGCAACAGGGAGGGGGAGATTTCATCAGTGTGGACA 

m 

GCCTGTCAACTTAGGATGGATGGCTGAGAGGGCTTCCTAGGAGCCAGTCAGCAGGGTGGGGTGGG 
GCCAGAGGAGCTCTCCAGCCCTGCCTAGTGGGCGCCCTGAGCCCCTTGTCGTGTGCTGAGCATGG 
CATGAGGCTGAAGTGGCAACCCTGGGGTCTTTGATGTCTTGACAGATTGACCATCTGTCTCCAGC 
CAGGCCACCCCTTTCCAAAATTCCCTCTTCTGCCAGTACTCCCCCTGTACCACCCATTGCTGATG 
GCACACCCAT C C TTAAGC TAAGACAGGAC GAT T GTGGTCC TCCCACAC TAAGGCCACAGCCCAT C 
CGCGTGCTGTGTGTCCCTCTTCCACCCCAACCCCTGCTGGCTCCTCTGGGAGCATCCATGTCCCG 
GAGAGGGGTCCCTCAACAGTCAGCCTCACCTGTCAGACCGGGGTTCTCCCGGATCTGGATGGCGC 
CGCCCTCTCAGCAGCGGGCACGGGTGGGGCGGGGCCGGGCCGCAGAGCATGTGCTGGATCTGTTC 
TGTGTGTCTGTCTGTGGGTGGGGGGAGGGGAGGGAAGTCTTGTGAAACCGCTGATTGCTGACTTT 
TGTGTGAAGAATCGTGTTCTTGGAGCAGGAAATAAAGCTTGCCCCGGGGCA 
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FIGURE Q2 

MQLTRCCFVFLVQGSLYLVICGQDDGPPGSEDPERDDHEGQPRPRVPRKRGHISPKSRPMANSTL 
LGLLAPPGEAWGILGQPPNRPNHSPPPSAKVKKIFGWGDFYSNIKTVALNLLVTGKIVDHGNGTF 
SVHFQHNATGQGNIS ISLVPPSKAVEFHQEQQI FIEAKASKI FNCRMEWEKVERGRRTSLCTHDP 
AKI CSRDHAQS SAT WSCS QPFKWCVY I AFYS TDYRLVQKVCPDYNYHSDTPYYPS G 

Important features of the protein: 
Signal peptide: 

amino acids 1-14 

N-glycosylation sites. 

amino acids 62-65, 127-130, 137-140, 143-146 

2-oxo acid dehydrogenases acyl transferase 

amino acids 61-71 
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FIGURE Q3 

CGGTGGCC ATGA CTGCGGCCGTGTTCTTCGGCTGCGCCTTCATTGCCTTCGGGCCTGCGCTCGCC 
CTTTATGTCTTCACCATCGCCATCGAGCCGTTGCGTATCATCTTCCTCATCGCCGGAGCTTTCT.T 
CTGGTTGGTGTCTCTACTGATTTCGTCCCTTGTTTGGTTCATGGCAAGAGTCATTATTGACAACA 
AAGATGGACCAACACAGAAATATCTGCTGATCTTTGGAGCGTTTGTCTCTGTCTATATCCAAGAA 
ATGTTCCGATTTGCATATTATAAACTCTTAAAAAAAGCCAGTGAAGGTTTGAAGAGTATAAACCC 
AGGTGAGACAGCACCCTCTATGCGACTGCTGGCCTATGTTTCTGGCTTGGGCTTTGGAATCATGA 
GTGGAGTATTTTCCTTTGTGAATACCCTATCTGACTCCTTGGGGCCAGGCACAGTGGGCATTCAT 
GGAGATTCTCCTCAATTCTTCCTTTATTCAGCTTTCATGACGCTGGTCATTATCTTGCTGCATGT 
ATTCTGGGGCATTGTATTTTTTGATGGCTGTGAGAAGAAAAAGTGGGGCATCCTCCTTATCGTTC 
TCCTGACCCACCTGCTGGTGTCAGCCCAGACCTTCATAAGTTCTTATTATGGAATAAACCTGGCG 
TCAGCATTTATAATCCTGGTGCTCATGGGCACCTGGGCATTCTTAGCTGCGGGAGGCAGCTGCCG 
AAGCCTGAAACTCTGCCTGCTCTGCCAAGACAAGAACTTTCTTCTTTACAACCAGCGCTCCAGAT 
AACCTCAGGGAACCAGCACTTCCCAAACCGCAGACTACATCTTTAGAGGAAGCACAACTGTGCCT 
TTTTCTGAAAATCCCTTTTTCTGGTGGAATTGAGAAAGAAATAAAACTATGCAGATA 
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FIGURE Q4 

MTAAVFFGCAFIAFGPAIJUjWFTIAIEPLRIIFLIAGAFFWLVSLLISSLWFMARVIIDNKDG 
PTQKYLLIFGAFVSVYIQEMFRFAYYKLLKKASEGLKSINPGETAPSMRLLAYVSGLGFGIMSGV 
FSFVNTLSDSLGPGTVGIHGDSPQFFLYSAFMTLVIILLHVFWGIVFFDGCEKKKWGILLIVLLT 
HLLVSAQT FI S S Y YG INLASAF 1 1 LVLMGT WAFLAAGGS CRS LKLCLLCQDKNFLL YNQRS R 

Important features of the protein: 
Signal peptide: 

amino acids 1-19 

Transmembrane domains: 

amino acids 32-51, 119-138, 152-169, 216-235 

Glycosarninoglycan attachment site. 

amino acids 120-123 

Sodium: neurotransmitter symporter family protein 

amino acids 31-65 
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FIGURE Q5 

AATTTTTCACCAGAGTAAACTTGAGAAACCAACTGGACCTTGAGTATTGTACATTTTGCCTCGTG 
GACCCAAAGGTAGCAAT CTGAAACATGAGGAGTACGAT T C TACT GT T T TGTCT TCTAGGATCAAC 
TCGGTCATTACCACAGCTCAAACCTGCTTTGGGACTCCCTCCCACAAAACTGGCTCCGGATCAGG 
GAACACTACCAAACCAACAGCAGTCAAAT CAGGTC TTTCCTTCTT TAAGT CTGATAC CATTAACA 
CAGATGCTCACAGTGGGGCCAGATCTGCATCTGTTAAATCCTGCTGCAGGAATGACACCTGGTAC 
CCAGACCCACCCATTGACCCTGGGAGGGTTGAATGTACAAGAGCAACTGCACCCACATGTGTTAC 
CAAT TT T T GT CACACAAC TT GGAGCCCAGGGCAC TATCC TAAGCTCAGAGGAAT TGCCACAAAT C 
TTCACGAGCCTCATCATCCATTCCTTGTTCCCGGGAGGCATCCTGCCCACCAGTCAGGCAGGGGC 
TAATCCAGATGTCCAGGATGGAAGCCTTCCAGCAGGAGGAGCAGGTGTAAATCCTGCCACCCAGG 
GAACCCCAGCAGGCCGCCTCCCAACTCCCAGTGGCACAGATGACGACTTTGCAGTGACCACCCCT 
GCAGGCATCCAAAGGAGCACACATGCCATCGAGGAAGCCACCACAGAATCAGCAAATGGAATTCA 
GTAAGCTGTTTCAAATTTTTTCAACTAAGCTGCCTCGAATTTGGTGATACATGTGAATCTTTATC 
AT T GAT TAT AT TAT G GAAT AGAT T GAGAC AC AT T GGAT AG T C T T AGAAGAAAT TAAT T C T TAAT T 
T AC C T G AAAAT AT T C T T GAAAT T T C AG AAAAT AT G T T C TAT G T AG AG AAT C C C AAC T T T T AAAAA 
CAATAAT TCAATGGATAAATC T GT C T T T GAAAT ATAACATTAT GCTGCCT GGATGATATGCATAT 
TAAAACATATTTGGAAAACTGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE q6 

MRSTILLFCLLGSTRSLPQLKPALGLPPTKLtAPDQGTLPNQQQSNQVFPSLSLIPLTQM 
LTLGPDLHLLNPAAGMTPGTQTHPLTLGGLNVQQQLHPHVLPIFVTQLGAQGTILSSEE 
LPQIFTSLII HS L FPGGI LPT S QAGANPDVQDGS L PAGGAGVNPAT QGT PAGRLPT P S G 
TDDDFAVTTPAGIQRSTHAIEEATTESANGIQ 

Signal peptide: 

amino acids 1-16 
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FIGURE Q7 



GCTCAAGTGCCCTGCCTTGCCCCACCCAGCCCAGCCTGGCCAGAGCCCCCTGGAGAAGGAGCTCT 
CTTCTTGCTTGGCAGCTGGACCAAGGGAGCCAGTCTTGGGCGCTGGAGGGCCTGTCCTGACCATG 
GTCCCTGCCTGGCTGTGGCTGCTTTGTGTCTCCGTCCCCCAGGCTCTCCCCAAGGCCCAGCCTGC 
AGAGCTGTCTGTGGAAGTTCCAGAAAACTATGGTGGAAATTTCCCTTTATACCTGACCAAGTTGC 
CGCTGCCCCGTGAGGGGGCTGAAGGCCAGATCGTGCTGTCAGGGGACTCAGGCAAGGCAACTGAG 
GGCCCATTTGCTATGGATCCAGATTCTGGCTTCCTGCTGGTGACCAGGGCCCTGGACCGAGAGGA 
GCAGGCAGAGTACCAGC TACAGGT CACCCTGGAGAT GCAGGAT GGACAT G T CTT GTGGGG TC CAC 
AGCCTGTGCTTGTGCACGTGAAGGATGAGAATGACCAGGTGCCCCATTTCTCTCAAGCCATCTAC 
AGAGCTCGGCTGAGCCGGGGTACCAGGCCTGGCATCCCCTTCCTCTTCCTTGAGGCTTCAGACCG 
G GATGAGCCAGGCACAGC CAAC TCGGAT C T TC GAT T CCACATCCTGAGCCAGGC TCCAGCCCAGC 
CTTCCCCAGACATGTTCCAGCTGGAGCCTCGGCTGGGGGCTCTGGCCCTCAGCCCCAAGGGGAGC 
f* k ACCAGCCTTGACCACGCCCTGGAGAGGACCTACCAGCTGTTGGTACAGGTCAAGGACATGGGTGA 
IJ CCAGGCCTCAGGCCACCAGGCCACTGCCACCGTGGAAGTCTCCATCATAGAGAGCACCTGGGTGT 
Q CCCTAGAGCCTATCCACCTGGCAGAGAATCTCAAAGTCCTATACCCGCACCACATGGCCCAGGTA 
ffl CACTGGAGTGGGGGTGATGTGCACTATCACCTGGAGAGCCATCCCCCGGGACCCTTTGAAGTGAA 
7 { TGCAGAGGGAAACCT CTACGT GACCAGAGAGC T GGACAGAGAAGCCCAGGC TGAGTACC T GCTCC 

AGGTGCGGGCTCAGAATTCCCATGGCGAGGACTATGCGGCCCCTCTGGAGCTGCACGTGCTGGTG 
111 ATGGATGAGAATGACAACGTGCCTATCTGCCCTCCCCGTGACCCCACAGTCAGCATCCCTGAGCT 
yl CAGTCCACCAGGTACTGAAGTGACTAGACTGTCAGCAGAGGATGCAGATGCCCCCGGCTCCCCCA 
f|| ATTCCCACGTTGTGTATCAGCTCCTGAGCCCTGAGCCTGAGGATGGGGTAGAGGGGAGAGCCTTC 
CAGGTGGACCCCACTTCAGGCAGTGTGACGCTGGGGGTGCTCCCACTCCGAGCAGGCCAGAACAT 
1.. CCTGCTTCTGGTGCTGGCCATGGACCTGGCAGGCGCAGAGGGTGGCTTCAGCAGCACGTGTGAAG 
0 TCGAAGT C GCAGTCACAGATATCAAT GAT CACGCCCC TGAGT TCAT CACT T CCCAGAT TGGGCC T 

tfl ATAAGCCTCCCTGAGGATGTGGAGCCCGGGACTCTGGTGGCCATGCTAACAGCCATTGATGCTGA 
H C CT CGAGCCCGCCT T CCGCCT CAT GGAT T T TGCCAT TGAGAGGGGAGACACAGAAGGGACT T TT G 

jJl GCCTGGATTGGGAGCCAGACTCTGGGCATGTTAGACTCAGACTCTGCAAGAACCTCAGTTATGAG 
iff GCAGCTCCAAGTCATGAGGTGGTGGTGGTGGTGCAGAGTGTGGCGAAGCTGGTGGGGCCAGGCCC 
Cj AGGCCCTGGAGCCACCGCCACGGTGACTGTGCTAGTGGAGAGAGTGATGCCACCCCCCAAGTTGG 
f|| ACCAGGAGAGCTACGAGGCCAGTGTCCCCATCAGTGCCCCAGCCGGCTCTTTCCTGCTGACCATC 

CAGCCCTCCGACCCCATCAGCCGAACCCTCAGGTTCTCCCTAGTCAATGACTCAGAGGGCTGGCT 
CTGCATTGAGAAATTCTCCGGGGAGGTGCACACCGCCCAGTCCCTGCAGGGCGCCCAGCCTGGGG 
ACACCTACACGGTGCTTGTGGAGGCCCAGGATACAGCCCTGACTCTTGCCCCTGTGCCCTCCCAA 
TACCTCTGCACACCCCGCCAAGACCATGGCTTGATCGTGAGTGGACCCAGCAAGGACCCCGATCT 
GGCCAGTGGGCACGGTCCCTACAGCTTCACCCTTGGTCCCAACCCCACGGTGCAACGGGATTGGC 
GCCTCCAGACTCTCAATGGTTCCCATGCCTACCTCACCTTGGCCCTGCATTGGGTGGAGCCACGT 
GAACACATAATCCCCGTGGTGGTCAGCCACAATGCCCAGATGTGGCAGCTCCTGGTTCGAGTGAT 
CGTGTGTCGCTGCAACGTGGAGGGGCAGTGCATGCGCAAGGTGGGCCGCATGAAGGGCATGCCCA 
CGAAGCTGTCGGCAGTGGGCATCCTTGTAGGCACCCTGGTAGCAATAGGAATCTTCCTCATCCTC 
ATTTTCACCCACTGGACCATGTCAAGGAAGAAGGACCCGGATCAACCAGCAGACAGCGTGCCCCT 
GAAGGCGACTGTCTGAATGGCCCAGGCAGCTCTAGCTGGGAGCTTGGCCTCTGGCTCCATCTGAG 
TCCCC TGGGAGAGAGCCCAGCACCCAAGAT CCAGCAGGGGACAGGACAGAGTAGAAGCCCC TCCA 
TCTGCCCTGGGGTGGAGGCACCATCACCATCACCAGGCATGTCTGCAGAGCCTGGACACCAACTT 
TATGGACTGCCCATGGGAGTGCTCCAAATGTCAGGGTGTTTGCCCAATAATAAAGCCCCAGAGAA 
CTGGGCT GGGCCCTATGGGAAAAAA?U^AAAAAAAAAAAAAAAAAAAAAAAG 



AppJD= 



10063565 



Page 249 of 320 



FIGURE q8 

MVPAWLWLLCVSVPQALPKAQPAELSVEVPENYGGNFPLYLTKLPLPREGAEGQIVLSGDSGKAT 
EGPFMDPDSGFLLVTRALDREEQAEYQLQVTLEMQDGHVXWGPQPVLVHVKDENDQVPHFSQAI 
YRARLSRGTRPGIPFLFLEASDRDEPGTANSDLRFHILSQAPAQPSPDMFQLEPRLGALALSPKG 
STSLDHALERTYQLLVQVKDMGDQASGHQATATVEVSIIESTWSLEPIHLAENLKVLYPHHMAQ 
VHWSGGDVHYHLESHPPGPFEVNAEGNLYVTRELDREAQAEYLLQVRAQNSHGEDYAAPLELHVL 
VMDENDNVPICPPRDPTVSIPELSPPGTEVTRLSAEDADAPGSPNSHWYQLLSPEPEDGVEGRA 
FQVDPTSGSVTLGVLPLRAGQNILLLVLAMDLAGAEGGFSSTCEVEVAVTDINDHAPEFITSQIG 
PISLPEDVEPGTLVAMLTAIDADLEPAFRLMDFAIERGDTEGTFGLDWEPDSGHVRLRLCKNLSY 
EAAPSHEVWWQSVAKLVGPGPGPGATATVTVLVERVMPPPKLDQESYEASVPISAPAGSFLLT 
IQPSDPISRTLRFSLVNDSEGWLCIEKFSGEVHTAQSLQGAQPGDTYTVLVEAQDTALTLAPVPS 
QYLCTPRQDHGLIVSGPSKDPDLASGHGPYSFTLGPNPTVQRDWRLQTLNGSHAYLTLALHWVEP 
REH 1 1 PWVSHNAQMWQLLVRVI VCRCNVEGQCMRKVGRMKGMP TKLS AVG I LVGTLVAI GIFLI 
LI FTHWTMSRKKDPDQPADSVPLKATV 

Signal peptide: 

amino acids 1-18 

Transmembrane domain: 

amino acids 762-784 
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FIGURE qq 



GGCTGACCGTGCTACATTGCCTGGAGGAAGCCTAAGGAACCCAGGCATCCAGCTGCCCACGCCTG 
AGTCCAAGAT TCT T CCCAGGAACACAAACGTAGGAGACCCACGC TCCTGGAAGCACCAGCC T T TA 
TCTCTTCACCTTCAAGTCCCCTTTCTCAAGAATCCTCTGTTCTTTGCCCTCTAAAGTCTTGGTAC 
ATCT AGGACCCAGGCATC T TGCT TT CCAGC CACAAAGAGACAGATGAAGAT GCAGAAAGGAAAT G 
T T C T C CT TAT GTTTGGTC T AC TAT T G CAT T T AGAAGC T G C AACAAAT T CCAAT GAGAC T AGCAC C 
TCTGCCAACACTGGATCCAGTGTGATCTCCAGTGGAGCCAGCACAGCCACCAACTCTGGGTCCAG 
TGTGACCTCCAGTGGGGTCAGCACAGCCACCATCTCAGGGTCCAGCGTGACCTCCAATGGGGTCA 
G CAT AG T CAC C AAC TCT GAG T T CC AT AC AAC C T C C AG T G GGAT C AG C AC AG C C AC C AAC TCT GAG 
T TCAGCACAGCGTC CAGTGGGAT CAGCATAGCCACCAAC TCTGAGT C CAGCACAACCT CCAGTGG 
GGCCAGCACAGCCACCAACTCTGAGTCCAGCACACCCTCCAGTGGGGCCAGCACAGTCACCAACT 
u CTGGGTCCAGTGTGACCTCCAGTGGAGCCAGCACTGCCACCAACTCTGAGTCCAGCACAGTGTCC 
III AGT AGGGCCAGCAC TGCCACCAACT CTGAGTC TAGCACAC TCT CCAGT GGGGCCAGCACAGC CAC 

P CAACT CT GACT CCAGCACAACCT CCAGT GGGGC TAGCACAGCCACCAAC T CTGAGTCCAGCACAA 

Q CCT CCAGT GGGGCCAGCACAGC CAC CAACT CTGAGT CCAGCACAGT GTCCAGTAGGGCCAGCAC T 

r|] GCCACCAAC T CT GAGTCCAGCACAACCT CCAGTGGGGCCAGCACAGCCACCAAC T CTGAGTCCAG 

I'll AACGACCTCCAATGGGGCTGGCACAGCCACCAACTCTGAGTCCAGCACGACCTCCAGTGGGGCCA 
f!j GCACAGC CAC CAAC TCTGACTCCAGCACAGT GT CCAGTGGGGCCAGCACT GCCACCAAC TC TGAG 

TCCAGCACGACCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAGCACGACCTCCAGTGG 
CP GGCTAGCACAGCCACCAACTCTGACTCCAGCACAACCTCCAGTGGGGCCGGCACAGCCACCAACT 
iff CTGAGT C CAGCACAGTGT CCAGT GGGATCAGCACAGT CAC CAAT TCT GAG T CCAGCACACCC TCC 

1' AGTGGGGCCAACACAGCCACCAACTCTGAGTCCAGTACGACCTCCAGTGGGGCCAACACAGCCAC 
% CAACTCTGAGTCCAGCACAGTGTCCAGTGGGGCCAGCACTGCCACCAACTCTGAGTCCAGCACAA 
^ CCTCCAGT GGGGTCAGCACAGCCAC CAACT CTGAGTCCAGCACAACCT CCAG TGGGGC TAGCACA 

ill GCCAC CAACTC TGAC TCCAGCACAACC TCCAGT GAGGCCAGCACAGCCAC CAAC TCT GAGT C TAG 

Q CACAGTGTCCAGTGGGATCAGCACAGTCACCAATTCTGAGTCCAGCACAACCTCCAGTGGGGCCA 
f{\ ACACAGCCACCAACTCTGGGTCCAGTGTGACCTCTGCAGGCTCTGGAACAGCAGCTCTGACTGGA 
% ATGCACACAACTTCCCATAGTGCATCTACTGCAGTGAGTGAGGCAAAGCCTGGTGGGTCCCTGGT 
W-[ GCCGTGGGAAATCTTCCTCATCACCCTGGTCTCGGTTGTGGCGGCCGTGGGGCTCTTTGCTGGGC 
flj TCTTCTTCTGTGTGAGAAACAGCCTGTCCCTGAGAAACACCTTTAACACAGCTGTCTACCACCCT 

CATGGCCTCAACCATGGCCTTGGTCCAGGCCCTGGAGGGAATCATGGAGCCCCCCACAGGCCCAG 
GTGGAGTCCTAACTGGTTCTGGAGGAGACCAGTATCATCGATAGCCATGGAGATGAGCGGGAGGA 
ACAGCGGGCCCTGAGCAGCCCCGGAAGCAAGTGCCGCATTCTTCAGGAAGGAAGAGACCTGGGCA 
CCCAAGACCTGGTTTCCTTTCATTCATCCCAGGAGACCCCTCCCAGCTTTGTTTGAGATCCTGAA 
AAT C T TGAAGAAGGTATTCC TCACCT T TCT TGCC TT TACCAGACACT GGAAAGAGAATACTATAT 
TGCTCATT TAGCTAAGAAATAAATACATCTCATCTAACACACACGACAAAGAGAAGC TGTGCTTG 
CCCCGGGGTGGGTATCTAGCTCTGAGATGAACTCAGTTATAGGAGAAAACCTCCATGCTGGACTC 
CATCT GGCATTCAAAATC TCCACAGTAAAA.TCCAAAGACCTCAAAAAAAAAAAAAA?\AA?\AAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



App_ID=10063565 



Page 251 of 320 



FIGURE 100 

MKMQKGNVLLMFGLLLHLEAATNSNETSTSANTGSSVISSGASTATNSGSSVTSSGVSTATISGS 
SVTSNGVSIVTNSEFHTTSSGISTATNSEFSTASSGISIATNSESSTTSSGASTATNSESSTPSS 
GASTVTNSGSSVTSSGASTATNSESSTVSSRASTATNSESSTLSSGASTATNSDSSTTSSGASTA 
TNSESSTTSSGASTATNSESSTVSSRASTATNSESSTTSSGASTATNSESRTTSNGAGTATNSES 
STTSSGASTATNSDSSTVSSGASTATNSESSTTSSGASTATNSESSTTSSGASTATNSDSSTTSS 
GAGTATNSESSTVSSGISTVTNSESSTPSSGANTATNSESSTTSSGANTATNSESSTVSSGASTA 
TNSESSTTSSGVSTATNSESSTTSSGASTATNSDSSTTSSEASTATNSESSTVSSGISTVTNSES 
STTSSGANTATNSGSSVTSAGSGTAALTGMHTTSHSASTAVSEAKPGGSLYPWEIFLITLVSWA 
AVGLFAGLFFCVRNSLSLRNTFNTAVYHPHGLNHGLGPGPGGNHGAPHRPRWSPNWFWRRPVSSI 
AMEMSGRNSGP 

Signal peptide: 

amino acids 1-20 

Transmembrane domain: 

amino acids 510-532 
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FIGURE 101 

GGCCGGACGCCTCCGCGTTACGGGATGAATTAACGGCGGGTTCCGCACGGAGGTTGTGACCCCTA 
CGGAGCCCCAGCTTGCCCACGCACCCCACTCGGCGTCGCGCGGCGTGCCCTGCTTGTCACAGGTG 
GGAGGCT GGAACTATCAGGCT GAAAAACAGAGTGGGTAC T CTC T TCT GGGAAGC TGGCAACAAAT 
GGAT G AT G T GATATATGCAT T CCAG G GGAAG GGAAAT T G T GGT G CT T CT GAAC C C AT GGT CAAT T 
AACGAGGCAGT TTC TAGC TAC T GCAC GTACT TCATAAAGCAGGACT CT AAAAGC T TT GGAAT CAT 
GGTGTCATGGAAAGGGATTTACTTTATACTGACTCTGTTTTGGGGAAGCTTTTTTGGAAGCATTT 
TCATGCTGAGTCCCTTTTTACCTTTGATGTTTGTAAACCCATCTTGGTATCGCTGGATCAACAAC 
k* CGCCTTGTGGCAACATGGCTCACCCTACCTGTGGCATTATTGGAGACCATGTTTGGTGTAAAAGT 
2 GATTATAACTGGGGATGCATTTGTTCCTGGAGAAAGAAGTGTCATTATCATGAACCATCGGACAA 

1« GAATGGACTGGATGTTCCTGTGGAATTGCCTGATGCGATATAGCTACCTCAGATTGGAGAAAATT 

•LI \ 

yj TGCCTCAAAGCGAGTGTCAAAGGTGTTCCTGGATTTGGTTGGGCCATGCAGGCTGCTGCCTATAT 

H] C T T CAT T CAT AG GAAAT GGAAG GAT GACAAGAGCC AT TTC GAAGAC AT GAT T GAT TAC T T T T GT G 

W 1 ATAT TCACGAACCAC T T CAACT CCT CAT AT TCCCAGAAGGGAC TGAT C TCACAGAAAACAGCAAG 

111 

TCT CGAAGTAATGCATT TGCTGAAAAAAATGGACTTCAGAAATATGAATATGT TT TACATCCAAG 

: n AACTACAGGCT TTACTTT TGTGGTAGACCGTCTAAGAGAAGGTAAGAACCT TGATGC TGTCCATG 

\n ATATCACTGTGGCGTATCC TCACAACAT T C CT CAATCAGAGAAGCACC TCCTCCAAGGAGAC TT T 

Q CCCAGGGAAATCCACTTTCACGTCCACCGGTATCCAATAGACACCCTCCCCACATCCAAGGAGGA 

riJ CCT T CAAC TCTGGT GCCACAAAC GGT GGGAAGAGAAAGAAGAGAGGC T GCG T TCCT TC TAT CAAG 

h; gggagaagaatttttattttaccggacagagtgtcattccaccttgcaagtctgaactcagggtc 

ill 

cttgtggtcaaattgctctctatactgtattggaccctgttcagccctgcaatgtgcctactcat 
atatttgtacagtcttgttaagtggtattttataatcaccattgtaatctttgtgctgcaagaga 
gaatatttggtggactggagatcatagaacttgcatgttaccgacttttacacaaacagccacat 
ttaaattcaaagaaaaatgagtaagattataaggtttgccatgtgaaaacctagagcatattttg 
gaaatgttctaaacctttctaagctcagatgcatttttgcatgactatgtcgaatatttcttact 
gccatcattatttgttaaagatattttgcacttaattttgtgggaaaaatattgctacaattttt 
tttaatctctgaatgtaatttcgatactgtgtacatagcagggagtgatcggggtgaaataactt 
gggccagaatattattaaacaatcatcaggcttttaaa 
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FIGURE 102 



MHSRGREIWLLNPWSINEAVSSYCTYFIKQDSKSFG IMVSWKGIYFILTLFWGSFFGSIFMLSP 

FLPMFVNPSWYRWINNRLVATWLTLPVALLETMFGVKVIITGDAFVPGERSVIIMNHRTRM^ 

FLWNCLMRYSYLRLEKICLKASLKGVPGFGWAMQAAAYIFIHRKWKDDKSHFEDMIDYFCDIHEP 

LQLLIFPEGTDLTENSKSRSNAFAEKNGLQKYEYVLHPRTTGFTFWDRLREGKNLDAVHDITVA 

YPHNIPQSEKHLLQGDFPREIHFHVHRYPIDTLPTSKEDLQLWCHKRWEEKEERLRSFYQGEKNF 

YFTGQSVI PPCKSELRVLWKLLS I LYWTLFS PAMCLL I YLYS LVKWY FI I T I VI FVLQERI FGG 

LEI IELACYRLLHKQPHLNSKKNE 

Important features of the protein: 
Signal peptide: 

amino acids 1-22 

Transmembrane domains: 

amino acids 44-63, 90-108, 354-377 
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FIGURE 1Q3 



CGGCTCGAGCGGCTCGAGTGAAGAGCCTCTCCACGGCTCCTGCGCCTGAGACAGCTGGCCTGACC 
TCCAAAT CAT CCATCCACC CC TGC T G TCAT CTGT TT TCATAGTGTGAGAT CAACCCACAGGAATA 
TCCATGGCTTTTGTGCTCATTTTGGTTCTCAGTTTCTACGAGCTGGTGTCAGGACAGTGGCAAGT 
CACTGGACCGGGCAAGTTTGTCCAGGCCTTGGTGGGGGAGGACGCCGTGTTCTCCTGCTCCCTCT 
TTCCTGAGACCAGTGCAGAGGCTATGGAAGTGCGGTTCTTCAGGAATCAGTTCCATGCTGTGGTC 
CACCTCTACAGAGATGGGGAAGACTGGGAATCTAAGCAGATGCCACAGTATCGAGGGAGAACTGA 
GTTTGTGAAGGACTCCATTGCAGGGGGGCGTGTCTCTCTAAGGCTAAAAAACATCACTCCCTCGG 
ACATCGGCCTGTATGGGTGCTGGTTCAGTTCCCAGATTTACGATGAGGAGGCCACCTGGGAGCTG 
CGGGTGGCAGCACTGGGCTCACTTCCTCTCATTTCCATCGTGGGATATGTTGACGGAGGTATCCA 
GTTACTCTGCCTGTCCTCAGGCTGGTTCCCCCAGCCCACAGCCAAGTGGAAAGGTCCACAAGGAC 
AGGATT T G TC T T CAGAC TCCAGAGCAAAT GCAGATGGGTACAGC CT GTAT GAT GT GGAGAT CTCC 
AT TATAG T CCAGGAAAATGCT GGGAGCATAT TGTGT T CCAT CCACC T T GC TGAGCAGAGTCATGA 
GGTGGAATCCAAGGTATTGATAGGAGAGACGTTTTTCCAGCCCTCACCTTGGCGCCTGGCTTCTA 
TTTTACTCGGGTTACTCTGTGGTGCCCTGTGTGGTGTTGTCATGGGGATGATAATTGTTTTCTTG 
AAAT CCAAAGGGAAAATCCAGGCGGAAC T GGACTGGAGAAGAAAGCACGGACAGGCAGAAT TGAG 
AGACGCCCGGAAACACGCAGTGGAGGTGACTCTGGATCCAGAGACGGCTCACCCGAAGCTCTGCG 
TTTCTGATCTGAAAACTGTAACCCATAGAAAAGCTCCCCAGGAGGTGCCTCACTCTGAGAAGAGA 
TTTACAAGGAAGAGTGTGGTGGCTTCTCAGGGTTTCCAAGCAGGGAGACATTACTGGGAGGTGGA 
CGTGGGACAAAATGTAGGGTGGTATGTGGGAGTGTGTCGGGATGACGTAGACAGGGGGAAGAACA 
ATGTGACTTTGTCTCCCAACAATGGGTATTGGGTCCTCAGACTGACAACAGAACATTTGTATTTC 
ACATTCAATCCCCATTTTATCAGCCTCCCCCCCAGCACCCCTCCTACACGAGTAGGGGTCTTCCT 
GGACTATGAGGGTGGGACCATCTCCTTCTTCAATACAAATGACCAGTCCCTTATTTATACCCTGC 
T GACATGT CAGT TT GAAGGC TT GT TGAGACCCTATATCCAGCATGCGATGT AT GACGAGGAAAAG 
GGGACTCCCATATTCATATGTCCAGTGTCCTGGGGATGAGACAGAGAAGACCCTGCTTAAAGGGC 
CCCACACCACAGACCCAGACACAGCCAAGGGAGAGT GCTCC CGACAGGT GGCCCCAGC T TC C TCT 
CCGGAGCCTGCGCACAGAGAGTCACGCCCCCCACTCTCCTTTAGGGAGCTGAGGTTCTTCTGCCC 
TGAGCCCTGCAGCAGCGGCAGTCACAGCTTCCAGATGAGGGGGGATTGGCCTGACCCTGTGGGAG 
T CAGAAGCCATGGCT GCCC TGAAGT GGGGACGGAATAGAC T CACAT TAGGT T TAGT T TGTGAAAA 
CTCCATCCAGCTAAGCGATCTTGAACAAGTCACAACCTCCCAGGCTCCTCATTTGCTAGTCACGG 
ACAGTGAT T CC TGCC TCACAGGT GAAGAT TAAAGAGACAAC GAAT GT GAATCATGC TTGCAGGT T 
TGAGGGCACAGTGT T TGCTAAT GAT GTGT T T T TAT AT TATACAT TT T CCCACCATAAACTC TGT T 
TGCTTAT TCCACATTAATT TACTTT TCTCTATACCAAATCACCCATGGAATAGT TATTGAACACC 
TGCT T TGT GAGGC TCAAAGAATAAAGAGGAGGTAGGAT T T T TCACT GATT C T ATAAGCCCAGCAT 
TACCTGATACCAAAACCAGGCAAAGAAAACAGAAGAAGAGGAAGGAAAACTACAGGTCCATATCC 
CTCAT TAACACAGACACAAAAAT TCTAAATAAAATT TTAACAAATTAAACTAAACAATATAT TT A 
AAGAT GATATATAAC TAC TCAGT GT GGT T T GT CCCACAAATGCAGAGT TGGT TTAATAT TTAAAT 
ATCAACCAGT GTAAT TCAGCACAT TAAT AAAGTAAAAAAGAAAACCATAAAAAAAAAAAAAAA 
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FIGURE 104 

MAFVLILVLSFYELVSGQWQVTGPGKFVQALVGEDAVFSCSLFPETSAEAMEVRFFRNQFHAWH 
LYRDGEDWESKQMPQYRGRTEFVKDS IAGGRVSLRLKNI TPSDI GLYGCWFS SQI YDEEATWELR 
VAALGSLPLISIVGYVDGGIQLLCLSSGWFPQPTAKWKGPQGQDLSSDSRANADGYSLYDVEISI 
I VQENAGS I LC S IHLAEQSHE VE SKVL I GET FFQP S PWRLAS I LLGLLCGALCGWMGM 1 1 VFFK 
SKGKIQAELDWRRKHGQAELRDARKHAVEVTLDPETAHPKLCVSDLKTVTHRKAPQEVPHSEKRF 
TRKSWASQGFQAGRHYWEVDVGQNVGWYVGVCRDDVDRGKNNVTLSPNNGYWVLRLTTEHLYFT 
FNPHFISLPPSTPPTRVGVFLDYEGGTISFFNTNDQSLIYTLLTCQFEGLLRPYIQHAMYDEEKG 
TPIFICPVSWG 

Signal peptide: 

amino acids 1-17 

Transmembrane domains: 

amino acids 131-150, 235-259 
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CC TT CACAGG ACTCTTCATT GCTGGT T GGC AATGA T GT ATCGGC C AGATGTGGT GAGGGCT AGGAAAAGAG 
TTTGTTGGGAACCCTGGGTTATCGGCCTCGTCATCTTCATATCCCTGATTGTCCTGGCAGTGTGCATTGGA 
CTC ACT GT T CAT TATGTG AGATAT AATC AAAAGAAGAC CT ACAATT ACT ATAGC AC AT TGT C AT TT AC AAC 
TGACAAACTATATGCTGAGTTTGGCAGAGAGGCTTCTAACAATTTTACAGAAATGAGCCAGAGACTTGAAT 
CAATGGTGAAAAAT GCAT TTT ATAAATCT C CAT T AAGGGAAGAATT T GTC AAGTCT CAGGTT AT CAAGTT C 
AGTC AACAGAAGC AT GGAGT GTTG GCTCAT AT GCT GTT GATT TGT AGAT T TCAC TCT ACT GAGG AT CC TG A 
AACTGTAGATAAAATTGTTCAACTTGTTTTACATGAAAAGCTGCAAGATGCTGTAGGACCCCCTAAAGTAG 
AT CCTC ACT C AGTT AAAATT AAAAAAATCAAC AAGAC AGAAAC AGACAGCT AT C TAAAC C AT TGCTGC GGA 
ACACGAAGAAGTAAAACTCT AGGTCAGAGT CTCAGGAT CGTTGGTGGGAC AGAAGT AGAAGAGGGTGAAT G 
'« GCCCTGGCAGGCTAGCCTGCAGTGGGATGGGAGTCATCGCTGTGGAGCAACCTTAATTAATGCCACATGGC 
T T GT GAGT GCTGCT CACT GT TT T ACAAC AT AT AAGAACCCTGCC AG AT GGACT GCTT CCT T T GGAGT AAC A 



AT AAAACCT T CG AAAATGAAACGGGGTCT C CGGAGAAT AATT GT CC AT GAAAAATACAAACACCCATC AC A 
TGACTATGATATTTCTCTTGCAGAGCTTTCTAGCCCTGTTCCCTACACAAATGCAGTACATAGAGTTTGTC 
TCCCTGATGCATCCTATGAGTTTCAACCAGGTGATGTGATGTTTGTGACAGGATTTGGAGCACTGAAAAAT 
GATGGTTACAGTCAAAATCATCTTCGACAAGCACAGGTGACTCTCATAGACGCTACAACTTGCAATGAACC 
T C AAGC T T ACAATG AC GCCAT AACTCCT AG AATGTT AT GTGC TGGCTCCT TAGAAGGAAAAAC AGATGCAT 
GCC AGGGT GACT CT GG AGGACC AC TGGTT AGT T CAGAT GCT AGAGAT ATC TGGT ACCTT GCT GGAAT AGT G 
UJ AGCT GGGG AGAT GAAT GT GC GAAAC C CAAC AAGCCT GGTGT T T ATACT AG AGTT ACGGCC TT GCGGGACT G 

M GATT AC TT CAAAAACT GG TATCTAAGAGAC AAAAGC CTC ATGGAACAGAT AAC ATT TTTTTTTGTTTTTTG 

GGTGTGGAGGCC AT TTT T AG AGAT AC AGAATT GG AG AAGACT TGC AAAAC AGCT AGATTT GACTGAT C TC A 
ATAAACTGTTTGCTTGATGCATGTATTTTCTTCCCAGCTCTGTTCCGCACGTAAGCATCCTGCTTCTGCCA 
GATC AACTCT GT CATCTGT GAGC AAT AGT T GAAACT TT ATGT AC AT AGAGAAAT AG AT AAT ACAAT AT T AC 
ATTACAGCCTGTATTCATTTGTTCTCTAGAAGTTTTGTCAGAATTTTGACTTGTTGACATAAATTTGTAAT 
GC AT AT AT AC AATTTGAAGC ACTCCT T TTCTT C AGT TC CTCAGC TCCT CT CAT T TC AGCAAAT ATCCATT T 
T C AAGG TGCAGAACAAGGAGTG AAAG AAAATAT AAG AAGAAAAAAATCCC CT AC AT T TT ATT GGCACAGAA 
AAGT ATTAGGTGT T TT TCTT AGTGGAAT ATTAGAAAT GAT CAT ATT CATT ATGAAAGGTCAAGC AAAGAC A 
GCAGAATACCAATCACTTCATCATTTAGGAAGTATGGGAACTAAGTTAAGGAAGTCCAGAAAGAAGCCAAG 
AT AT AT CCTT AT T TTC AT TT CC AAAC AACT ACT ATGAT AAAT GT GAAGAAGATT CT GTTT TT TTGT GACCT 
AT AAT AAT TAT ACAAACT TC AT GCAATGT ACT T GTT CT AAGCAAAT TAAAGCAAAT ATTT AT TT AACAT TG 
TTACTGAGGATGTCAACATATAACAATAAAATATAAATCACCCA 



ru 
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FIGURE 106 

MMYRPDWRARKRVCWE PWVI GLVI FISL IVLAVC I GLT VHYVRYNQKKT YNY YS TLSFT TDKLY 
AEFGREASNNFTEMSQRLESMVKNAFYKSPLREEFVKSQVIKFSQQKHGVLAHMLLICRFHSTED 
PETVDKIVQLVLHEKLQDAVGPPKVDPHSVKIKKINKTETDSYLNHCCGTRRSKTLGQSLRIVGG 
TEVEEGEWPWQASLQWDGSHRCGATLINATWLVSAAHCFTTYKNPARWTAS FGVTIKPSKMKRGL 
RRIIVHEKYKHPSHDYDISLAELSSPVPYTNAVHRVCLPDASYEFQPGDVMFVTGFGALKNDGYS 
QNHLRQAQVTLIDATTCNEPQAYNDAITPRMLCAGSLEGKTDACQGDSGGPLVSSDARDIWYLAG 
IVSWGDE CAKPNKPGVYTRVTALRDW ITSKTGI 

Transmembrane domain: 

amino acids 21-40 (type II) 
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AGAGAAAGAAGCGTCTCCAGCTGAAGCCAATGCAGCCCTCCGGCTCTCCGCGAAGAAGTTCCCTG 
CCCCGATGAGCCCCCGCCGTGCGTCCCCGACTATCCCCAGGCGGGCGTGGGGCACCGGGCCCAGC 
GCCGACGATCGCTGCCGTTTTGCCCTTGGGAGTAGGATGTGGTGAAAGGATGGGGCTTCTCCCTT 
ACGGGGCTCACAATGGCCAGAGAAGATTCCGTGAAGTGTCTGCGCTGCCTGCTCTACGCCCTCAA 
TCTGCTCTTTTGGTTAATGTCCATCAGTGTGTTGGCAGTTTCTGCTTGGATGAGGGACTACCTAA 
ATAATGT TCTCACT TTAACTGCAGAAACGAGGGTAGAGGAAGCAGTCATT T TGACTTACTT TCCT 
GTGGTTCATCCGGTCATGATTGCTGTTTGCTGTTTCCTTATCATTGTGGGGATGTTAGGATATTG 
TGGAACGGTGAAAAGAAATCTGTTGCTTCTTGCATGGTACTTTGGAAGTTTGCTTGTCATTTTCT 
G T G T AGAAC TGGCTTGTGGCGTTTG GACATAT GAACAGGAAC T TAT G G T T C C AG T ACAAT GG TCA 
GATATGGTCAC TTTGAAAGCCAGGATGACAAAT TATGGAT TACCTAGATATCGGTGGCTTACTCA 
T GCT T GGAAT T T T T T TCAGAGAGAGT T TAAGTGC TGT GGAGTAG TATATT TCAC TGAC TGGT TGG 
AAAT GACAGAGATGGACTGGC CC CCAGAT TCC T GCTGTGT TAGAGAAT TC C CAGGAT GTT CCAAA 
CAGGCCCACCAGGAAGATC TCAGTGACC T T TATCAAGAGGG T TG TGGGAAGAAAATGTAT TCCT T 
T TTGAGAGGAACCAAACAACTGCAGGT GCT GAGGT T T CT GGGAATC T CCAT TGGGGTGACACAAA 
TCCTGGCCATGATTCTCACCATTACTCTGCTCTGGGCTCTGTATTATGATAGAAGGGAGCCTGGG 
ACAGACCAAAT GATGTCC TTGAAGAATGACAACTCT CAGCACC T GT CATGT CCCTCAGTAGAAC T 
GTTGAAACCAAGCC TGTCAAGAATCT TTGAACACACATCCATGGCAAACAGCT TTAATACACACT 
T T GAGAT GGAGGAGT T ATAAAAAGAAAT GT CACAGAAGAAAACCACAAACT TGTT TTAT TGGAC T 
TGTGAAT TTTTGAGTACATACTATGT GTT TCAGAAATATGTAGAAATAAAAATGTTGCCATAAAA 
TAACACCTAAGCATATACTATTCTAT GCTT TAAAATGAGGATGGAAAAGTT TCATGTCATAAGTC 
ACCACCTGGACAATAATTGATGCCCTTAAAATGCTGAAGACAGATGTCATACCCACTGTGTAGCC 
T GT GT AT GACTT TTAC TGAACACAG T TATGTT T TGAGGCAGCAT GGT T T GAT TAGCAT T T CCGCA 
T CCAT GCAAACGAGTCACATAT GGTGGGACTGGAGC CATAGTAAAGGT TGAT TTACT TC TACCAA 
C TAGTATATAAAGTACTAATTAAATGCT AACATAGGAAGT TAGAAAATAC TAATAAC T T TTAT T A 
CTCAGCGATC TAT TC T TCTGATGCTAAAT AAAT TATATATCAGAAAAC TT T CAATAT TGGTGACT 
ACCT AAAT GT GAT T T TTGC TGGT TACTAAAATAT TCT TACCACT T AAAAGAGCAAGCTAACACAT 
TGTCTTAAGCTGATCAGGGATTTTTTGTATATAAGTCTGTGTTAAATCTGTATAATTCAGTCGAT 
T TCAGT TCTGATAATGT TAAGAATAACCAT T ATGAAAAGGAAAAT TTGTCC T GTATAGCATCAT T 
ATTTTTAGCCTTTCCTGTTAATAAAGCTTTACTATTCTGTCCTGGGCTTATATTACACATATAAC 
T GTTAT T TAAATAC T TAACCACT AAT T T T GAAAAT TACCAGTGT GATACATAGGAATCATTATT C 
AGAATGTAGTC TGGT C T T TAGGAAGTAT TAATAAGAAAAT T TGCACAT AACT TAGT TGATTCAGA 
AAGGACTTGTATGCTGTTTTTCTCCCAAATGAAGACTCTTTTTGACACTAAACACTTTTTAAAAA 
GCT TATC T T TGCC T TC TCCAAACAAGAAGCAATAGTC TCCAAGT CAATAT AAAT T C TACAGAAAA 
TAGT G TT CT TT T TCTCCAGAAAAAT GC T T GT GAGAATCAT TAAAACAT GT GACAAT T TAGAGAT T 
CTTTGTTT TAT T TCAC T GAT T AAT AT AC TGT GGC AAAT T ACACAGAT TAT T AAAT T T T T T T AC AA 
GAGT ATAGT ATATT TATT T GAAATGGGAAAAG TGCAT T T TACTGTAT T TT GTGTAT T T TGT T TAT 
T T C T CAGAATATGGAAAGAAAAT TAAAAT GTG TCAATAAATAT T T T CT AGAGAG TAA 
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FIGURE 108 

MAREDSVKCLRCLLYALNLLFWLM 

VMIAVCCFLI IVGMLGYCGTVKRNLLLLAWYFGSLLVI FCVELACGVWTYEQELMVPVQWSDMVT 
LKAI^TNYGLPRYRWLTHAWNFFQREFKCCGVVYFTDWLEMTEMDWPPDSCCVREFPGCSKQAHQ 
EDLS DLYQEGCGKKMYS FLRGTKQLQVLRFLGI S I GVTQ ILAMI LT I T LLWALYYDRREPGTDQM 
MSLPCNDNSQHLSCPSVELLKPSLSRIFEHTSMANSFNTHFEMEEL 

Signal peptide: 

amino acids 1-33 

Transmembrane domains: 

amino acids 12-35, 57-86, 94-114, 226-248 
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CCAAGGCCAGAGCTGTGGACACCTTATCCCACTCATCCTCATCCTCTTGCTCTGATAAAGCCCCTACCAGTGCT 
GATAAAGTCTTTCTCGTGAGAGCCTAGAGGCCTTAAAAAAAAAAGTGCTTGAAAGAGAAGGGGACAAAGGAACA 
CCAGTATTAAGAGGATTTTCCAGTGTTTCTGGCAGTTGGTCCAGAAGG ATG CCTCCATTCCTGCTTCTCACCTG 
CCTCTTCATCACAGGCACCTCCGTGTCACCCGTGGCCCTAGATCCTTGTTCTGCTTACATCAGCCTGAATGAGC 
CCT GGAGGAACACT GACCACCAGTT GGATGAGTCT CAAGGTCCTCCTCT AT GT GAC AACCAT GT GAAT GGGGAG 
TGGTACC ACTT CAC GGGCATGGCGGGAGATGCCAT GCCT ACCTTCT GCATACCAGAAAACCACT GTGGAACCCA 
CGCACCTGTCTGGCTCAATGGCAGCCACCCCCTAGAAGGCGACGGCATTGTGCAACGCCAGGCTTGTGCCAGCT 
TCAATGGGAACTGCTGTCTCTGGAACACCACGGTGGAAGTCAAGGCTTGCCCTGGAGGCTACTATGTGTATCGT 
L.y CT GACC AAGCC CAGC GT C T GCTT CC AC GT C TACT GT GGT C ATT T T T AT GACATCT G C GAC GAGGAC T GCCAT GG 

Q CAG CT GCT C AG AT AC C AGC GAGT GC AC AT GC GCT C C AGGAACT GT GC T AGGC CC T G AC AGGC AGAC AT GCT T T G 

Q | ATGAAAAT GAAT GT GAGC AAAAC AAC GGT GGCTGC AGT GAGATCT GT GTGAACCTC AAAAACTCCTACCGCT GT 

SI GAGTGT GGGGT TGGCCGTGT GCT AAGAAGT GAT GGCAAGACTT GTGAAGACGTTGAAGGATGCCACAATAAC AA 

Ul T GGT GGCT GC A GCC AC TCTTGCCTT GG AT C T GAGAAAGG C T AC C AGT GT GAAT GT C C CC G GGGCCT GGT GCT GT 

! g 3 

kfi CT GAGGAT AAC CAC ACTT GCC AAGT CCCTGTGTTGT GCAAATC AAAT GCCAT TGAAGT GAACATCCCC AGGGAG 

.W* CTGGTTGGTGGCCTGGAGCTCTTCCTGACCAACACCTCCTGCCGAGGAGTGTCCAACGGCACCCATGTCAACAT 

CCTCTTCTCTCT CAAGAC AT GT GGT AC AGT GGT C GAT GT GGT GAAT GAC AAGAT T GT GGC CAGC AAC C TC GT GA 
^ CAGGTCTACCCAAGCAGACCCCGGGGAGCAGCGGGGACTTCATCATCCGAACCAGCAAGCTGCTGATCCCGGTG 

h;J ACCTGCGAGTTTCCACGCCTGTACACCATTTCTGAAGGATACGTTCCCAACCTTCGAAACTCCCCACTGGAAAT 

If 

CAT GAGCC GAAAT CAT GGGATCT T CCC AT T C ACTC T GGAGATCT T C AAGGAC AAT GAGT T T GAAGAGC CT T ACC 

n ■ 

S ! J GGGAAGCTCTGCCCACCCTCAAGCTTCGTGACTCCCTCTACTTTGGCATTGAGCCCGTGGTGCACGTGAGCGGC 
TTGGAAAGCTT GGTGGAGAGCT GCTTT GCCACCCCCACCTCCAAGATCGACGAGGTCCTGAAATACTACCTCAT 

1% CCGGGATGGCTGTGTTTCAGATGACTCGGTAAAGCAGTACACATCCCGGGATCACCTAGCAAAGCACTTCCAGG 
TCCCTGTCTTCAAGTTTGTGGGCAAAGACCACAAGGAAGTGTTTCTGCACTGCCGGGTTCTTGTCTGTGGAGTG 
TTGGACGAGCGTTCCCGCTGTGCCCAGGGTTGCCACCGGCGAATGCGTCGTGGGGCAGGAGGAGAGGACTCAGC 
CGGTCTACAGGGCCAGACGCTAACAGGCGGCCCGATCCGCATCGACTGGGAGGACTAGTTCGTAGCCATACCTC 
GAGTCCCTGCATTGGACGGCTCTGCTCTTTGGAGCTTCTCCCCCCACCGCCCTCTAAGAACATCTGCCAACAGC 
TGGGTTCAGACTTCACACTGTGAGTTCAGACTCCCAGCACCAACTCACTCTGATTCTGGTCCATTCAGTGGGCA 
CAGGTCACAGCACTGCTGAACAATGTGGCCTGGGTGGGGTTTCATCTTTCTAGGGTTGAAAACTAAACTGTCCA 
CCCAGAAAGACACTCACCCCATTTCCCTCATTTCTTTCCTACACTTAAATACCTCGTGTATGGTGCAATCAGAC 
CAC AAAAT C AG AAGCT GGGT AT AATAT T TCAAGTT ACAAACC C T AGAAAAAT T AAACAGT TACT GAAATTATGA 
CTT AAAT AC CC AAT GAC TC CTT AAAT AT GT AAAT TAT AGT T AT ACC T T GAAAT T T CAAT T C AAAT GC AGACT AA 
T TAT AGG GAAT T T GGAAGT GTAT CAAT AAAAC AGT AT AT AATTTT 
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FIGURE no 



MPPFLLLTCLFITGTSVSPVALDPCSAYISLNEPWRNTDHQLDESQGPPLCDNHVNGEWYHFTGMAGDAMP 
TFCIPENHCGTHAPVWLNGSHPLEGDGIVQRQACASFNGNCCLWNTTVEVKACPGGYYVYRLTKPSVCFHV 
YCGH FYDICDEDCHGSCS DT SECTCAPGTVLGPDRQTC FDENECEQNNGGCSE ICVNLKNSYRCECGVGRV 
LRSDGKTCEDVEGCHNNNGGCSHSCLGSEKGYQCECPRGLVLSEDNHTCQVPVLCKSNAIEVNIPRELVGG 
LELFLTNTSCRGVSNGTHVNILFSLKTCGTWDWNDKIVASNLVTGLPKQTPGSSGDFIIRTSKLLIPVT 
CEFPRLYTISEGYVPNLRNSPLEIMSRNHGIFPFTLEIFKDNEFEEPYREALPTLECLRDSLYFGIEPWHV 
SGLESLVESCFATPTSKIDEVLECYYLIRDGCVSDDSVKQYTSRDHLAKHFQVPVFKFVGKDHKEVFLHCRV 
LVCGVLDERSRCAQGCHRRMRRGAGGEDSAGLQGQTLTGGPIRIDWED 

Important features of the protein: 
Signal peptide: 

amino acids 1-16 

N-glycosylation sites. 

amino acids 89-93, 116-120, 259-263, 291-295, 299-303 

Tyrosine kinase phosphorylation sites. 

amino acids 411-418, 443-451 

N-myristoylation sites. 

amino acids 226-232, 233-239, 240-246, 252-258, 296-302, 300-306, 
522-528, 531-537 

Aspartic acid and asparagine hydroxy lation site. 

amino acids 197-209 

ZP domain proteins. 

amino acids 431-457 

Calcium-binding EGF-like proteins. 

amino acids 191-212, 232-253 
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FIGURE ill 



GAGAGAGGC AGC AGCTTGCT C AGCGGAC AAGGAT GCTGGGCGTGAGGG AC CAAGGC CTGCCC TGC ACT CGG 
GCCTCCTCCAGCCAGTGCTGACCAGGGACTTCTGACCTGCTGGCCAGCCAGGACCTGTGTGGGGAGGCCCT 
CC TGCT GCCTTGGGGT GACAATCT CAGCT C CAGGCT AC AGGG AGACCGGG AGGATC ACAG AGCC AGCATGT 
TACAGGATCCTGACAGTGATCAACCTCTGAACAGCCTCGATGTCAAACCCCTGCGCAAACCCCGTATCCCC 
AT GG AG ACCTTC AG AAAGGT GGGG AT CCCC AT C ATC AT AGCACT ACTG AGCC TGGCG AGT AT C ATCAT TGT 
GGTTGTCCTCATCAAGGTGATTCTGGATAAATACTACTTCCTCTGCGGGCAGCCTCTCCACTTCATCCCGA 
GGAAG C AGCTGT GT GACG GAGAGC TGGACT GT CCCT TGGGGG AGGACGAGGAGC ACTGTGTC AAGAGCTT C 
CCCGAAGGGCCTGCAGTGGCAGTCCGCCTCTCCAAGGACCGATCCACACTGCAGGTGCTGGACTCGGCCAC 
M f AGGGAACTGGTTCTCTGCCTGTTTCGACAACTTCACAGAAGCTCTCGCTGAGACAGCCTGTAGGCAGATGG 
M GC T AC AGC AG AGCT GTGG AG AT TGGC CCAGACC AGG AT CT GGAT GTTG TT GAAATCAC AGAAAAC AG CCAG 

^ GAGCTTCGCATGCGGAACTCAAGTGGGCCCTGTCTCTCAGGCTCCCTGGTCTCCCTGCACTGTCTTGCCTG 
f \ TGGGAAGAGCCTGAAGACCCCCCGTGTGGTGGGTGGGGAGGAGGCCTCTGTGGATTCTTGGCCTTGGCAGG 

US 

iVi TCAGCATCCAGTACGACAAACAGCACGTCTGTGGAGGGAGCATCCTGGACCCCCACTGGGTCCTCACGGCA 
h) GCCC ACTGCTT C AGGAAACAT ACCGAT GT G TT C AACTG GAAGGT GCGGGC AGGC TC AGAC AAACTGGGCAG 

j 11 CT TCCC ATCCCT GG CT GT GGCCAAGATCAT CATCAT TGAATT CAACCCCATGT ACCC CAAAG AC AATG AC A 

■3 TCGCCCTCATGAAGCTGCAGTTCCCACTCACTTTCTCAGGCACAGTCAGGCCCATCTGTCTGCCCTTCTTT 
£1 1 GATGAGGAGCTC AC T C CAGCC ACCCC ACT C TGGATC AT T GGATG GGGCTT T ACGAAGC AGAATGGAGGGAA 

U] GATGTCTGAC AT ACT GCT GC AGGCGT CAGT CC AGGT CAT TGACAGCAC ACGGTGC AATGCAG ACGATGCGT 

W^' AC C AGG GGGAAGTC AC CGAGAAGATGATGT GT GCAGGC ATCC CGGAAGGGGGTGTGGACACCTGC C AGGGT 

GACAGT GGTGGGCCCC T GAT GT AC CAAT CT GACCAGTGGC AT GT GGTGGGCATCGT T AGCTGGGGC TATGG 
CTGCGGGGGCCCGAGCACCCCAGGAGTATACACCAAGGTCTCAGCCTATCTCAACTGGATCTACAATGTCT 
GGAAGGCTGAGCTG TAAT GCTGCTGCCCCTTTGCAGTGCTGGGAGCCGCTTCCTTCCTGCCCTGCCCACCT 
GGGGAT CC C C CAAAGT CAGACAC AGAGC AAGAGTCCCCTTGGGT AC ACCC CTC T GCCC AC AGCCT C AGC AT 
T T CT T GGAGCAGCAAAGGGCCT C AATTCCT GT AAGAGACCCT CGC AGCCC AGAGGCGCC C AGAGGAAGTC A 
GC AGCCCT AGCT CGGC C AC ACTTGGT GCTCCC AGCATCCC AGGGAG AGAC AC AGCCCACT GAAC AAGGTCT 
CAGGGGT AT T GCTAAGCCAAGAAGGAACTT TCCC AC ACT ACTGAATGGAAGC AGGCTGT C TT GT AAAAGC C 
CAGATCACTGTGGGCTGGAGAGGAGAAGGAAAGGGTCTGCGCCAGCCCTGTCCGTCTTCACCCATCCCCAA 
GCCT AC T AGAGC AAGAAACCAGTT GTAAT AT AAAAT GCACTGCCCTACTGTTGGTAT GAC TACCGT T ACC T 
ACTGTT GT C ATT GT T ATT AC AGCT AT GGCC ACT AT TAT T AAAGAGCTGTGT AAC ATCT CT GGCAAAAAAAA 
AAAA 



ill 
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FIGURE 112 



MLQDPDSDQPLNS LDVKPLRKPRI PMET FRKVG I P 1 1 1 ALLS LAS 1 1 1 VWLI KVI LDKYY FLCG 
QPLHFI PRKQLCDGELDCPLGEDEEHCVKS FPEGPAVAVRL SKDRS TLQVLDSATGNWFSAC FDN 
FTEALAETACRQMGYSRAVEIGPDQDLDWEITENSQELRMRNSSGPCLSGSLVSLHCLACGKSL 
KT PRWGGEEASVDSWPWQVS I QYDKQHVCGGS I LDPHWVLTAAHC FRKHT DVFNWKVRAGSDKL 
GS FPSLAVAKI 1 1 1 E FNPMYPKDNDI ALMKLQFPLT FS GTVRP I CLPFFDEELTPAT PLWI I GWG 
FTKQNGGKMSDILLQASVQVIDSTRCNADDAYQGEVTEKMMCAGIPEGGVDTCQGDSGGPLMYQS 
DQWHWGI VS WGYGCGGPS T PGVYTKVSAYLNW I YNVWKAEL 

Transmembrane domain: 

amino acids 32-53 (typell) 
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FIGURE lis 



GGCTGGACTGGAACTCCTGGTCCCAAGTGATCCACCCGCCTCAGCCTCCCAAGGTGCTGTGATTA 
TAGGTGTAAGCCACCGTGTCTGGCCTCTGAACAACTTTTTCAGCAACTAAAAAAGCCACAGGAGT 
TGAACTGCTAGGATTCTGACTATGCTGTGGTGGCTAGTGCTCCTACTCCTACCTACATTAAAATC 
TGTTTTTTGTTCTCTTGTAACTAGCCTTTACCTTCCTAACACAGAGGATCTGTCACTGTGGCTCT 
GGCCCAAACCTGACCTTCACTGTGGAACGAGAACAGAGGTTTCTACCCACACCGTCCCCTCGAAG 
CCGGGGACAGCCTCAGCTTGCTGGCCTCTCGCTGGAGCAGTGCCCTCACCAACTGTCTCACGTCT 
GGAGGCACTGACTCGGGCAGTGCAGGTAGCTGAGCCTCTTGGTAGCTGCGGCTTTCAAGGTGGGC 
CTTGCCCTGGCCGTAGAAGGGAT TGA CAAGCCCGAAGATTTCATAGGCGATGGCTCCCACTGCCC 
Cl AGGCATCAGCCTTGCTGTAGTCAATCACTGCCCTGGGGCCAGGACGGGCCGTGGACACCTGCTCA 
GAAGCAGTGGGTGAGACATCACGCTGCCCGCCCATCTAACCTTTTCATGTGCTGCACATCACCTG 
ATCCATGGGCTAATCTGAACTCTGTCCCAAGGAACCGAGAGCTTGAGTGAGCTGTGGCTGAGACC 
CAGAAGGGGTCTGCTTAGACCACCTGGTTTATGTGACAGGACTTGCATTCTCCTGGAACATGAGG 
§) GAACGCCGGAGGAAAGCAAAGTGGCAGGGAAGGAACT TGTGCCAAAT TATGGGTCAGAAAAGATG 

yi GAGGT GT TGGGT TAT CACAAGGCAT CGAG T CT CCTGCAT TCAGT GGACAT GTGGGGGAAGGGC T G 

CCGATGGCGCATGACACACTCGGGACTCACCTCTGGGGCCATCAGACAGCCGTTTCCGCCCCGAT 
CCACGTACCAGCTGCTGAAGGGCAACTGCAGGCCGATGCTCTCATCAGCCAGGCAGCAGCCAAAA 
f** t TCTGCGAT CACCAGCCAGGGGCAGC CG TC TGGGAAGGAGCAAGCAAAG TGACCATT T CTCC TCCC 

flj CTCCTTCCCTCTGAGAGGCCCTCCTATGTCCCTACTAAAGCCACCAGCAAGACATAGCTGACAGG 
O GGCTAATGGCTCAGTGTTGGCCCAGGAGGTCAGCAAGGCCTGAGAGCTGATCAGAAGGGCCTGCT 
i 11 GTGCGAACACGGAAATGCCTCCAGTAAGCACAGGCTGCAAAATCCCCAGGCAAAGGACTGTGTGG 

CTCAAT TTAAATCATGT T C TAGTAAT T GGAGCTGTCCCCAAGACCAAAGGAGCTAGAGC T TGGT T 
CAAATGATCTCCAAGGGCCCTTATACCCCAGGAGACTTTGATTTGAATTTGAAACCCCAAATCCA 
AACCTAAGAACCAGGTGCATTAAGAATCAGTTATTGCCGGGTGTGGTGGCCTGTAATGCCAACAT 
TTTGGGAGGCCGAGGCGGGTAGATCACCTGAGGTCAGGAGTTCAAGACCAGCCTGGCCAACATGG 
TGAAACCCCTGTCTCTACTAAAAATACAAAAAAACTAGCCAGGCATGGTGGTGTGTGCCTGTATC 
CCAGCTACTCGGGAGGCTGAGACAGGAGAATTACTTGAACCTGGGAGGTGAAGGAGGCTGAGACA 
GGAGAATCAC T T CAGCCTGAGCAACACAGCGAGACTCTGT CTCAGAAAAAATAAAAAAAGAATTA 
TGGT TAT TTGTAA 



o 

m 

? 3? * 



ill 
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FIGURE 114 

MLWWLVLLLLPTLKSVFCSLVTSLYLPNTEDLSLWLWPKPDLHSGTRTEVSTHTVPSKPGTASPC 
WPLAGAVPSPTVSRLEALTRAVQVAEPLGSCGFQGGPCPGRRRD 



Signal peptide: 

amino acids 1-15 



a 
a 
m 
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FIGURE lis 

CAGCAGTGGTCTCTCAGTCCTCTCAAAGCAAGGAAAGAGTACTGTGTGCTGAGAGAC GATG GCAA 
AGAATCCTCCAGAGAAT TGTGAAGACTGT CACATTCTAAATGCAGAAGCT TTTAAATCCAAGAAA 
ATATGTAAATCACTTAAGATTTGTGGACTGGTGTTTGGTATCCTGGCCCTAACTCTAATTGTCCT 
GTTTTGGGGGAGCAAGCACTTCTGGCCGGAGGTACCCAAAAAAGCCTATGACATGGAGCACACTT 
TCTAGAGCAATGGAGAGAAGAAGAAGATTTACATGGAAAT TGATCC TGTGACCAGAACTGAAATA 
T TCAGAAGCGGAAATGGCACT GAT GAAACAT TGGAAGTGCACGACT T TAAAAACGGATACAC TGG 
CATCTACTTCGTGGGTCTTCAAAAATGTTTTATCAAAACTCAGATTAAAGTGATTCCTGAATTTT 
M CT GAACCAGAAGAGGAAATAGAT GAGAAT GAAGAAAT TACCACAAC T T TC T TT GAACAGTCAGTG 

AT T T GGG TCCCAGCAGAAAAGCC TAT T GAAAACCGAGAT T T T CT T AAAAAT TCCAAAAT TC TGGA 
m GATTTGTGATAACGTGACCATGTATTGGATCAATCCCACTCTAATATCAGTTTCTGAGTTACAAG 
j|j ACTTTGAGGAGGAGGGAGAAGATCTTCACTTTCCTGCCAACGAAAAAAAAGGGATTGAACAAAAT 
.1/1 GAACAGT GGGTGGTCCCT CAAGT GAAAG TAGAGAAGACCCGTCACGCCAGACAAGCAAGTGAGGA 

AGAAC T T CCAATAAATGACTATAC T GAAAAT GGAATAGAATTT GAT CCCATGCT GGATGAGAGAG 

: sit 

GTTATTGTTGTATTTACTGCCGTCGAGGCAACCGCTATTGCCGCCGCGTCTGTGAACCTTTACTA 

8 

f«j GGCTACTACCCATATCCATACTGCTACCAAGGAGGACGAGTCATCTGTCGTGTCATCATGCCTTG 
;J| TAACTGGTGGGTGGCCCGCATGCTGGGGAGGGTC TAA TAGGAGGTTTGAGCTCAAATGCTTAAAC 
Cl TGCTGGCAACATATAATAAATGCATGCTATTCAATGAATTTCTGCCTATGAGGCATCTGGCCCCT 
1 GGTAGCCAGCTCTCCAGAATTACTTGTAGGTAATTCCTCTCTTCATGTTCTAATAAACTTCTACA 
':?! TTATCACCAAAAAAAAAAAAAAAAAAA 
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FIGURE 116 



MAKNPPENCEDCHILNAKA.FKSKKICKSLKICGLVFGILALTLIVLFWGSKHFWPEVPKKAYDME 
HTFYSNGEKKKIYMEIDPVTRTEIFRSGNGTDETLEVHDFKNGYTGIYFVGLQKCFIKTQIKVIP 
EFSEPEEEIDENEEITTTFFEQSVIWVPAEKPIENRDFLKNSKILEICDNVTMYWINPTLISVSE 
LQDFEEEGEDLHFPANEKKGIEQNEQWWPQVKVEKTRHARQASEEELPINDYTENGIEFDPMLD 
ERGYCCIYCRRGNRYCRRVCEPLLGYYPYPYCYQGGRVICRVIMPCNWWVARMLGRV 

Important features of the protein: 
Signal peptide: 

amino acids 1-40 

Transmembrane domain: 

amino acids 25-47 (type II) 

N-glycosylation sites. 

amino acids 94-97, 180-183 

Glycosaminoglycan attachment sites. 

amino acids 92-95, 70-73, 85-88, 133-136, 148-151, 192-195, 239- 
242 

N-myristoylation sites . 

amino acids 33-38, 95-100, 116-121, 215-220, 272-277 

Microbodies C- terminal targeting signal. 

amino acids 315-317 

Cytochrome c family heme-binding site signature. 

amino acids 9-14 
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FIGURE 117 



GAGCTCCCCTCAGGAGCGCGTTAGCTTCACACCTTCGGCAGCAGGAGGGCGGCAGCTTCTCGCAGGCGGCA 
GGGCGGGCGGCCAGGATCATGTCCACCACCACATGCCAAGTGGTGGCGTTCCTCCTGTCCATCCTGGGGCT 
GGCCGGCTGCATCGCGGCCACCGGGATGGACATGTGGAGCACCCAGGACCTGTACGACAACCCCGTCACCT 
CC GT GT TC C AGT ACGAAGGGCT CT GG AGGAGCTGC G TGAGGC AGAGT T C AGGCT TC ACC G AATGCAGGCC C 
T ATT T C ACCATC C T GGGACT TCCAGC CAT GCT GC AG GCAGT G CGAGCCCT GAT GAT CGT AGGCATCGT CC T 
GGGTGCCATTGGCCTCCTGGTATCCATCTTTGCCCTGAAATGCATCCGCATTGGCAGCATGGAGGACTCTG 
CCAAAGCCAACATGACACTGACCTCCGGGATCATGTTCATTGTCTCAGGTCTTTGTGCAATTGCTGGAGTG 
TCTGTGTTTGCCAACATGCTGGTGACTAACTTCTGGATGTCCACAGCTAACATGTACACCGGCATGGGTGG 
GATGGTGCAGACTGTTCAGACCAGGTACACATTTGGTGCGGCTCTGTTCGTGGGCTGGGTCGCTGGAGGCC 
TCAC ACT AAT TGGGGGTGTGATGATGT G C ATCGCCT GCCGGGGC CT GGC ACG AG AAGAAACCAACT ACAAA 
GCCGTTTCTTATCATGCCTCAGGCCACAGTGTTGCCTACAAGCCTGGAGGCTTCAAGGCCAGCACTGGCTT 
T GGGTC CAACAC C AAAAACAAGAAGAT AT ACGATGG AGGT GCCC GCAC AGAGGACGAGGT ACAAT C TT ATC 
C TTCCAAGC ACGACT ATGT G TAATGCT C T AAGACCT CT CAGC AC GGGCGGAAGAAACTCC CGGAGAGCTC A 
C CCAAAAAACAAGGAG ATCCCATCTAGAT T TCTTCT TG CTTT TG ACTC AC AGCT GGAAGT TAGAAAAG CC T 
CGATTTCATCTTTGGAGAGGCCAAATGGTCTTAGCCTCAGTCTCTGTCTCTAAATATTCCACCATAAAACA 
GCTG AGTT AT TT AT GAAT TAGAGGCT AT AGCT CAC ATT TT C AAT CCTCT ATTT C TT T TTT TAAATATAACT 
T TCT AC TCT GAT GAGAGAATGT GGTT TT AATCTCT CTC TCAC AT T TTGAT GAT T TAGACAGACTCCCC CT C 
T TC CTCCT AGTC AAT AAAC C CATT GATGAT CT ATTT CC C AGCTT ATCCCC AAGAAAACT T TT GAAAGG AAA 
GAGTAGACCCAAAGATGTTATTTTCTGCTGTTTGAATTTTGTCTCCCCACCCCCAACTTGGCTAGTAATAA 
ACACTTACTGAAGAAGAAGCAATAAGAGAAAGATATTTGTAATCTCTCCAGCCCATGATCTCGGTTTTCTT 
AC ACTGTGAT CT T AAAAGT T ACCAAACC AAAGTCAT TT T C AGTT T G AGGC AACC AAACCTTT CT ACT GCT G 
T TGACATCT T CT T ATT AC AG CAACAC CAT T C T AGGAGT T T CCTGAGCT CTCC ACTGGAGT CCT CTTT C TGT 
CGC GGGTC AGAAAT TGT CC C TAGATGAAT GAGAAAATT ATTT TT T TTAAT TTAAGT CCT AAAT ATAGT TAA 
AATAAATAATGTTTTAGTAAAATGATACACTATCTCTGTGAZIATAGCCTCACCCCTACATGTGGATAGAAG 
GAAATGAAAAAAT AATTGCT TT GACATT GT CT ATAT GGT ACT TT GT AAAGTCAT GC T TAAGT AC AAAT TC C 
ATGAAAAGCTCACACCTGTAATCCTAGCACTTTGGGAGGCTGAGGAGGAAGGATCACTTGAGCCCAGAAGT 
TCGAGACT AGCC TGGGCAAC AT GGAGAAGC C C TGTC TCT ACAAAAT ACAGAGAGAAAAAATC AGCCAGT C A 
TGGT GGCATACACCTGT AGT CCCAGC ATT C CG GGAGGCT G AGGT GGGAGGATCACTTGAGCCCAGGGAGGT 
TGGG GCTGCAGT GAGCCATGATCACACCAC TG CACT CC AGCC AGGT GACATAGC GAGATCCT GTCT AAAAA 
AATAAAAAATAAATAATGGAACACAGCAAGTCCTAGGAAGTAGGTTAAAACTAATTCTTTAA 
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FIGURE 118 

MS T T TCQWAFLLS I LGLAGC I AAT GMDMWS TQDL YDNPVT S VFQYE GLWRS C VRQS S GFTECRP 
YFTILGLP^LQAVRALMIVGIVLGAIGLLVSIFALKCIRIGSMEDSAKANMTLTSGIMFIVSGL 
CAIAGVSVFANMLVTNFWMSTANMY^^ 

CRGLAPEETNYKAVSYHASGHSVAYKPGGFKASTGFGSNTKNKKIYDGGARTEDEVQSYPSKHDY 
V 

Signal peptide: 

amino acids 1-23 

Transmembrane domains: 

amino acids 81-100, 121-141, 173-194 
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FIGURE HQ 



GGAAAAACTGTTCTCTTCTGTGGCACAGAGAACCCTGCTTCAAAGCAGAAGTAGCAGTTCCGGAGTCC 
AGCTGGCTAAAACTCATCCCAGAGGATA ATGG CAACCCATGCCTTAGAAATCGCTGGGCTGTTTCTTG 
GTGGTGTTGGAATGGTGGGCACAGTGGCTGTCACTGTCATGCCTCAGTGGAGAGTGTCGGCCTTCATT 
GAAAACAACATCGTGGTTTTTGAAAACTTCTGGGAAGGACTGTGGATGAATTGCGTGAGGCAGGCTAA 
CATCAGGATGCAGTGCAAAATCTATGATTCCCTGCTGGCTCTTTCTCCGGACCTACAGGCAGCCAGAG 
GACTGATGTGTGCTGCTTCCGTGATGTCCTTCTTGGCTTTCATGATGGCCATCCTTGGCATGAAATGC 
ACCAGGTGCACGGGGGACAATGAGAAGGTGAAGGCTCACATTCTGCTGACGGCTGGAATCATCTTCAT 
CAT C ACG GGCAT GGTGGTGCT CAT C CC T G T GAGCT G GGT T GC CAAT GC CAT CAT C AGAGAT T T C TATA 

\svh 

**** ACTCAATAGTGAATGTTGCCCAAAAACGTGAGCTTGGAGAAGCTCTCTACTTAGGATGGACCACGGCA 
f| CTGGTGCTGATTGTTGGAGGAGCTCTGTTCTGCTGCGTTTTTTGTTGCAACGAAAAGAGCAGTAGCTA 
0] CAGAT ACTCGAT ACCT TCC CATCGCACAACCCAAAAAAGTTAT GACACCGGAAAGAAGTCACCGAGCG 

U| TCTACTCCAGAAGTCAGTATGTGTAGTTGTGTATGTTTTTTTAACTTTACTATAAAGCCATGCAAATG 
^ AGA?^AAATCTATATTACTTTCTCAAAATGGACCCCAAAGAAACTTTGATTTACTGTTCTTAACTGCCT 
%l AAT C TTAAT TACAGGAACTGTGCATCAGCTAT TTAT GAT T CTATAAGCTATTT CAGCAGAATGAGAT A 

TTAAACCCAATGCTTTGATTGTTCTAGAAAGTATAGTAATTTGTTTTCTAAGGTGGTTCAAGCATCTA 
f*$ CTC TTTTTATCAT TTACTTCAAAATGACATTGCTAAAGACTGCAT TAT T TTACTACT GTAATT TCTCC 

[|] ACGACATAGCATTATGTACATAGATGAGTGTAACAT T TATATC TCACATAGAGACAT GCTTAT AT GGT 

Q TTTATTTAAAATGAAATGCCAGTCCATTACACTGAATAAATAGAACTCAACTATTGCTTTTCAGGGAA 
Fli ATCATGGATAGGGTTGAAGAAGGT TAC TATTAATTGTTTAAAAACAGCT TAGGGAT TAATGTCCT CCA 

™ TTTATAATGAAGATTAAAATGAAGGCT TTAAT CAGCATTGTAAAGGAAATTGAATGGCTTTCTGATAT 

m 

GCTGTTTTTTAGCCTAGGAGTTAGAAATCCTAACTTCTTTATCCTCTTCTCCCAGAGGCTTTTTTTTT 
CTTGTGTATTAAATTAACATTTTTAAAACGCAGATATTTTGTCAAGGGGCTTTGCATTCAAACTGCTT 
TTCCAGGGCTATACTCAGAAGAAAGATAAAAGTGTGATCTAAGAAAAAGTGATGGTTTTAGGAAAGTG 
AAAATAT T TTTGT TTTT GTATTT GAAGAAGAATGAT GCATT TT GACAAGAAATCATATATGTATGGAT 
ATATTTTAATAAGTATTTGAGTACAGACTTTGAGGTTTCATCAATATAAATAAAAGAGCAGAAAAATA 
TGTCTTGGTTTTCATTTGCTTACCAAAAAAACAACAACAAAAAAAGTTGTCCTTTGAGAACTTCACCT 
GCTCCTATGTGGGTACCTGAGTCAAAATTGTCATTTTTGTTCTGTGAAAAATAAATTTCCTTCTTGTA 
CCATTTCTGTTTAGTTTTACTAAAATCTGTAAATACTGTATTTTTCTGTTTATTCCAAATTTGATGAA 
ACTGACAATCCAATTTGAAAGTTTGTGTCGACGTCTGTCTAGCTTAAATGAATGTGTTCTATTTGCTT 
TAT AC AT T TAT AT T AAT AAAT T GT ACAT T T T T C TAAT T 
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FIGURE 120 



MATHALEIAGLFLGGV(^GTVAVTVMPQWRVSAFIENNIWFENFWEGLWMNCVRQi^IRMQCK 
IYDSLLALSPDLQAARGLMCAASVMSFI^ 

MWLIPVSWANAI IRDFYNS IVNVAQKRELGEALYLGWTTALVLIVGGALFCCVFCCNEKSSSY 
RYS I PSHRTTQKS YHTGKKS PSVYSRSQYV 

Signal peptide: 

amino acids 1-17 

Transmembrane domains: 

amino acids 82-101, 118-145, 164-188 
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FIGURE 121 



GGAGAGAGGCGCGCGGGTGAAAGGCGCATTGATGCAGCCTGCGGCGGCCTCGGAGCGCGGCGGAG 
CCAGACGCTGACCACGTTCCTCTCCTCGGTCTCCTCCGCCTCCAGCTCCGCGCTGCCCGGCAGCC 
GGGAGCCATGCGACCCCAGGGCCCCGCCGCCTCCCCGCAGCGGCTCCGCGGCCTCCTGCTGCTCC 
TGCTGCTGCAGCTGCCCGCGCCGTCGAGCGCCTCTGAGATCCCCAAGGGGAAGCAAAAGGCGCAG 
CTCCGGCAGAGGGAGGTGGTGGACCTGTATAATGGAATGTGCTTACAAGGGCCAGCAGGAGTGCC 
TGGTCGAGACGGGAGCCCTGGGGCCAATGTTATTCCGGGTACACCTGGGATCCCAGGTCGGGATG 
GAT T CAAAGGAGAAAAGGGGGAAT GT CT GAGGGAAAGCT T TGAGGAGT CCT GGACACCCAAC TAC 
AAGCAGT GTTCATGGAGT TCAT TGAATTAT GGCATAGAT CT TGGGAAAAT T GCGGAGTGTACAT T 
T ACAAAGATGCGTT CAAATAGT GCTC TAAGAGT TT T GTTCAGT GGC T CAC T TCGGCTAAAATGCA 
GAAATGCATGCTGTCAGCGTTGGTATTTCACATTCAATGGAGCTGAATGTTCAGGACCTCTTCCC 
ATT GAAGC TATAAT T TAT T TGGACCAAGGAAGCCCTGAAATGAAT TCAACAAT TAATAT TCATCG 
CACTTCTTCTGTGGAAGGACTTTGTGAAGGAATTGGTGCTGGATTAGTGGATGTTGCTATCTGGG 
TTGGCACTTGTTCAGATTACCCAAAAGGAGATGCTTCTACTGGATGGAATTCAGTTTCTCGCATC 
AT TAT T G AAG AAC TAC C AAAATAAAT GC T T T AAT T T T CAT T T GC T AC CTCTTTTTT TAT TAT GC C 
TTGGAAT GGTT CACT TAAATGACAT T T TAAATAAGT T TATGTATACAT CT GAAT GAAAAGCAAAG 
CT AAATATGTT TACAGACCAAAGTGT GAT TTCACAC T GT T T TT AAAT C TAGCAT TAT TCAT T T TG 
CT TCAAT CAAAAGTGGT T TCAATAT T T TT TT TAGT TGGT TAGAATACT TTCT TCATAGT CACATT 
CTCT CAACCTATAAT T TGGAATATT GT TG TGGTC T TTTGTTTTTTCTCT T AGTATAGCAT T T T TA 
AAAAAAT ATAAAAGC TAC CAATCTT TG TACAAT T TGTAAAT GT TAAGAAT T T TT TT TATATC TGT 
TAAAT AAAAAT TAT T TCCAACA 
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FIGURE 122 



MRPQGPAASPQRLRGLLLLLLLQLPAPSSASE I PKGKQKAQLRQREWDL YNGMCLQGPAGVPGR 
DGS PGANVI PGT PG I PGRDGFKGEKGECLRE S FEE S WTPNYKQC S WS SLNYG I DLGKIAECT FTK 
MRSNSALRVLFSGSLRLKCRNACCQRWYFTFNGAECSGPLPIEAIIYLDQGSPEMNSTINIHRTS 
SVE GLCEG I GAGLVDYAI WVGT CS D Y PKGDAS TGWNS VS R 1 1 IEELPK 

Signal peptide: 

amino acids 1-30 

Transmembrane domain: 

amino acids 195-217 
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FIGURE 123 

GCTGAGCGTGTGCGCGGTACGGGGCTCTGCTGCCTTCTGGGCTCCAACGCAGCTCTGTGGCTGAA 
CTGGGTGCTCATCACGGGAACTGCTGGGCTATGGAATACAGATGTGGCAGCTCAGGTAGCCCCAA 
AT T GCCT GGAAGAATACATCATG TT T T T CGATAAGAAGAAATT G T AGGAT CCAGT TT T T TT T T T A 
ACCGCCCCCTCCCCACCCCCCAAAAAAACTGTAAAGATGCAAAAACGTAATATCCATGAAGATCC 
TATTACCTAGGAAGATTTTGATGTTTTGCTGCGAATGCGGTGTTGGGATTTATTTGTTCTTGGAG 
TGTTCTGCGTGGCTGGCAAAGAATAATGTTCCAAAATCGGTCCATCTCCCAAGGGGTCCAATTTT 
TCTTCCTGGGTGTCAGCGAGCCCTGACTCACTACAGTGCAGCTGACAGGGGCTGTCATGCAACTG 
GCCCC T AAGC CAAAGCAAAAGACCTAAGGACGACCT T TGAACAATACAAAGGATGGGT T TCAAT G 
TAATTAGGCTACTGAGCGGATCAGCTGTAGCACTGGTTATAGCCCCCACTGTCTTACTGACAATG 
CTTTCTTCTGCCGAACGAGGATGCCCTAAGGGCTGTAGGTGTGAAGGCAAAATGGTATATTGTGA 
, , ATCTCAGAAAT TACAGGAGATACCC T CAAGTATATC TGCTGGT T GCT T AGG T T T GTCC CT TCGCT 

M ? ATAACAGCC TT CAAAAAC T TAAGT AT AAT CAAT T TAAAGGGCTCAACCAGC TCAC CTGGC TATAC 

CJ C TT GACCATAACC ATATCAGCAAT AT TGACGAAAAT GC T T T TAATGGAATACGCAGAC TCAAAGA 

Q GCT GATT C T TAG TT CCAATAGAATC T C CT AT T T TC T TAACAATACCT T CAGACCTGTGACAAAT T 

%\ TACGGAACTTGGATCTGTCCTATAATCAGCTGCATTCTCTGGGATCTGAACAGTTTCGGGGCTTG 
fl CGGAAGC T GCTGAGT T TACAT T TACGG TCTAAC TGCC TGAGAACCAT CCC T GT GCGAATAT T CCA 

l 'M AGAC T GCCGCAACC TGGAACT T T TGGACC T GGGATATAACCGGAT CCGAAGT T TAGCCAGGAAT G 

111 TCTT T GCT GGCATGATCAGACTCAAAGAAC T TCACCT GGAGCACAAT CAAT T TTCCAAGCT CAAC 

ffl CTGGCCCTTTTTCCAAGGTTGGTCAGCCTTCAGAACCTTTACTTGCAGTGGAATAAAATCAGTGT 
\l% CATAGGACAGACCATGTCC TGGACCT GGAGCTCC T TACAAAGGC T T GATT TAT CAGGC AAT GAGA 

TCGAAGCTTTCAGTGGACCCAGTGTTTTCCAGTGTGTCCCGAATCTGCAGCGCCTCAACCTGGAT 
TCCAACAAGCTCACATTTATTGGTCAAGAGATTTTGGATTCTTGGATATCCCTCAATGACATCAG 
Q T CT T GCTGGGAATATATGGGAAT GCAGCAGAAATAT T TGCT CCC T T GT AAAC TGGC TGAAAAGT T 

01 T TAAAGGT C TAAGGGAGAATACAAT T AT CT GT GC CAGTCCCAAAGAGCTGCAAGGAGTAAATGT G 

%S ATCGATGCAGTGAAGAAC TACAGCATCTGTGGCAAAAGTACTACAGAGAGGT TTGATCTGGCCAG 

j!^ GGCTCTCCCAAAGCCGACGTTT7UVGCCCAAGCTCCCCAGGCCGAAGCATGAGAGCAAACCCCCTT 
f M TGCCCCCGACGGTGGGAGCCACAGAGCCCGGCCCAGAGACCGATGCTGACGCCGAGCACATCTCT 
CI TTCCATAAAATCATCGCGGGCAGCGTGGCGCTTTTCCTGTCCGTGCTCGTCATCCTGCTGGTTAT 
fy CTACGTGTCATGGAAGCGGTACCCTGCGAGCATGAAGCAGCTGCAGCAGCGCTCCCTCATGCGAA 

GGCACAGGAAAAAGAA?^GACAGTCCCTAAAGC?\AATGACTCCCAGCACCCAGGAAT TT TATGTA 
GAT TATAAACCCACCAACACGGAGACCAGCGAGATGC TGC TGAATGGGACGGGACCC T GCACCTA 
TAACAAAT C GGGCTCCAGGGAGT GT GAGGTATGAACCAT TGTGATAAAAAGAGC T C T TAAAAGC T 
GGGAAATAAGTGGTGCTTTATTGAACTCTGGTGACTATCAAGGGAACGCGATGCCCCCCCTCCCC 
TTCCCTCTCCCTCTCACTTTGGTGGCAAGATCCTTCCTTGTCCGTTTTAGTGCATTCATAATACT 
GGTCATT TT CC TCT CATACATAATCAACC CAT TGAAAT T TAAATACCACAAT CAATGTGAAGC T T 
GAACT CCGGTT TAATATAATACC TAT T GT ATAAGACCC T T TACT GAT T CCAT TAATGTCGCAT T T 
GTT T TAAGATAAAACT TCTT TCATAGGTAAAAAAAAAAA 
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FIGURE 12a 

MGFNVIRLLSGSAVALVIAPTVLLTMLSSAERGCPKGCRCEGKMVYCESQKLQEIPSSISAGCLG 
LSLRYNSLQKLKYNQFKGLNQLTWLYLDHNHISNIDENAFNGIRRLKELILSSNRISYFLNNTFR 
PVTNLRNLDLSYNQLHSLGSEQFRGLRKLLSLHLRSNSLRTIPVRIFQDCRNLELLDLGYNRIRS 
LARNVFAGMI RLKELHLEHNQ FSKLNLAL FPRLVS LQNL YLQWNKI S VI GQTMS WTWS S LQRLDL 
SGNEIEAFSGPSVFQCVPNLQRLNLDSNKLTFIGQEILDSWISLNDISLAGNIWECSRNICSLVN 
WLKS FKGLRENT 1 1 CAS PKELQGVNVI DAVKNYS I CGKS TTERFDLARALPKPT FKPKLPRPKHE 
SKPPLPPTVGATEPGPETDADAEHISFHKIIAGSVALFLSVLVILLVIYVSWKRYPASMKQLQQR 
SLMRRHRKKKRQSLKQMTPSTQEFYVDYKPTNTETSEMLLNGTGPCTYNKSGSRECEV 

! ;ff Important features of the protein: 

01 

: Signal peptide: 

m 

\f\ amino acids 1-33 

<?g» ■ 

; § >i 

w?i Transmembrane domain: 

amino acids 420-442 

ill 

.f J N-glycosylation sites. 

flj amino acids 126-129, 357-360, 496-499, 504-507 

rs? 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 465-468 

Tyrosine kinase phosphorylation site. 

amino acids 136-142 

N-myristoylation sites. 

amino acids 11-16, 33-38, 245-250, 332-337, 497-502, 507-512 
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FIGURE 12s 

CCGTTATCGTCTTGCGCTACTGCTGA ATG TCCGTCCCGGAGGAGGAGGAGAGGCTTTTGCCGCTG 
ACCCAGAGATGGCCCCGAGCGAGCAAATTCCTACTGTCCGGCTGCGCGGCTACCGTGGCCGAGCT 
AGCAACCTTTCCCCTGGATCTCACAAAAACTCGACTCCAAATGCAAGGAGAAGCAGCTCTTGCTC 
GGTTGGGAGACGGTGCAAGAGAATCTGCCCCCTATAGGGGAATGGTGCGCACAGCCCTAGGGATC 
ATTGAAGAGGAAGGC TT TC TAAAGC T TT GGCAAGGAGTGACACCCGC CAT T TACAGACACG T AG T 
GT AT TCT GGAGGTCGAAT GGT CACATAT GAACATCTCCGAGAGGT T GTGT T TGGCAAAAGT GAAG 
ATGAGCATTATCCCCTTTGGAAATCAGTCATTGGAGGGATGATGGCTGGTGTTATTGGCCAGTTT 
N> TTAGCCAAT CCAACTGACC TAGTGAAGG T T CAGAT GCAAATGGAAGGAAAAAGGAAAC TGGAAGG 

0 AAAACCATTGCGATTTCGTGGTGTACATCATGCATTTGCAAAAATCTTAGCTGAAGGAGGAATAC 
5*( GAGGGCT TTGGGCAGGCT GGGTACCCAAT AT ACAAAGAGCAGCACT GGTGAATAT GGGAGAT T TA 

T.\ ACCACT TATGATACAGTGAAACACT ACT T GGTAT TGAATACACCAC T T GAGGACAATAT CATGAC 

jj| TCACGGTTTATCAAGTTTATGTTCTGGACTGGTAGCTTCTATTCTGGGAACACCAGCCGATGTCA 
fj s T CAAAAGCAGAATAAT GAATCAACCACGAGATAAACAAGGAAGGGGACT T T TG TATAAAT CATCG 

yi ACTGACTGCTTGATTCAGGCTGTTCAAGGTGAAGGATTCATGAGTCTATATAAAGGCTTTTTACC 

AT CT TGGCTGAGAATGACCCC T TGG T CAAT GGTGT TC TGGC TTACT TATGAAAAAAT CAGAGAGA 
TK TGAGTGGAGTCAGTCCATTTTAA 

m 
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FIGURE 126 

MSVPEEEERLLPLTQRWPRASKFLLSGCAATVAELATFPLDLTKTRLQMQGEAALARLGDGARES 
APYRGKVRTALGIIEEEGFLKLWQGVTPAIYRHVVYSGGRNTVTYEHLREVVFGKSEDEHYPLWKS 
VIGGMMAGVIGQFLANPTDLVKVQMQMEGK^ 

NIQRAALVNMGDLTTYDTVKHYLVLNTPLEDNIMTHGLSSLCSGLVASILGTPADVIKSRIMNQP 
RDKQGRGLLYKSSTDCLIQAVQGEGFMSLYKGFLPSWLRMTPWSMVFWLTYEKIREMSGVSPF 

Transmembrane domains: 

amino acids 25-38, 130-147, 233-248 
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FIGURE 127 

CGCGGATCGGACCCAAGCAGGTCGGCGGCGGCGGCAGGAGAGCGGCCGGGCGTCAGCTCCTCGAC 
CCCCGTGTCGGGCTAGTCCAGCGAGGCGGACGGGCGGCGTGGGCCCATGGCCAGGCCCGGCATGG 
AGCGGTGGCGCGACCGGCTGGCGCTGGTGACGGGGGCCTCGGGGGGCATCGGCGCGGCCGTGGCC 
CGGGCCCTGGTCCAGCAGGGACTGAAGGTGGTGGGCTGCGCCCGCACTGTGGGCAACATCGAGGA 
GCTGGCTGCTGAATGTAAGAGTGCAGGCTACCCCGGGACTTTGATCCCCTACAGATGTGACCTAT 
CAAATGAAGAGGACATCC TCTCCAT GT T CTCAGCTATCC GT T C T CAGCACAGCGGTG TAGACAT C 
TGCATCAACAATGCTGGCTTGGCCCGGCCTGACACCCTGCTCTCAGGCAGCACCAGTGGTTGGAA 
GGACATGTTCAATGTGAACGTGCTGGCCCTCAGCATCTGCACACGGGAAGCCTACCAGTCCATGA 
AGGAGCGGAATGTGGACGATGGGCACAT CAT TAACAT CAATAGCATGTCT GGCCACC GAGT GT TA 
CCCCTGTCTGTGACCCACTTCTATAGTGCCACCAAGTATGCCGTCACTGCGCTGACAGAGGGACT 
GAGGCAAGAGCTTCGGGAGGCCCAGACCCACATCCGAGCCACGTGCATCTCTCCAGGTGTGGTGG 
AGACACAATTCGCC T TCAAACT CCACGACAAGGACCC TGAGAAGGCAGCT GCCACCTATGAGCAA 
ATGAAGTGTCTCAAACCCGAGGATGTGGCCGAGGCTGTTATCTACGTCCTCAGCACCCCCGCACA 
CATCCAGAT T GGAGACAT CCAGAT GAGGCCCACGGAGCAGGTGACC TAGT GAC TGTGGGAGCTCC 
TCCTTCCCTCCCCACCCTTCATGGCTTGCCTCCTGCCTCTGGATTTTAGGTGTTGATTTCTGGAT 
CACGGGATACCACT TCCTGTCCACACCCCGACCAGGGGCTAGAAAAT TTGT TTGAGATT TT TATA 
TCATCTTGTCAAATTGCTTCAGTTGTAAATGTGAAAAATGGGCTGGGGAAAGGAGGTGGTGTCCC 
TAATTGTTTTACTTGTTAACTTGTTCTTGTGCCCCTGGGCACTTGGCCTTTGTCTGCTCTCAGTG 
TCTTCCCTTTGACATGGGAAAGGAGTTGTGGCCAAAATCCCCATCTTCTTGCACCTCAACGTCTG 
TGGCTCAGGGCTGGGGTGGCAGAGGGAGGCCTTCACCTTATATCTGTGTTGTTATCCAGGGCTCC 
AGACTTCCTCCTCTGCCTGCCCCACTGCACCCTCTCCCCCTTATCTATCTCCTTCTCGGCTCCCC 
AGCCCAGTCTTGGCTTCTTGTCCCCTCCTGGGGTCATCCCTCCACTCTGACTCTGACTATGGCAG 
CAGAACACCAGGGCCT GGCCCAGT GGAT T T CATGGTGATCATTAAAAAAGAAAAATCGCAACCAA 
AAAAAAAAAA 
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FIGURE 128 

MARPGMERWRDRLALVTGASGGI 

PYRCDLSNEEDILSMFSAIRSQHSGVDICIlSntfAGLA^^ 

EAYQSMKERNVDDGHIININSMSGHRVLPLSVTHFYSATKYAVTALTEGLRQELREAQTHIRATC 
I S PGWETQ FAFKLHDKDPEKAAAT YEQMKCLKPEDVAEAVI YVLS T PAHI QI GD IQMRPTEQVT 



Important features of* the protein: 
Signal peptide: 

amino acids 1-17 

o 

**** N-myristoylation sites. 

f\ amino acids 18-24, 21-27, 22-28, 24-30, 40-46, 90-96, 109-115, 

ill 199-205 

Ml Short-chain alcohol dehyrogenase. 

*L amino acids 30-42, 104-114 

ill 



, it* 
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FIGURE 12Q 

AACTTCTAC ATG GGCCTCCTGCTGCTGGTGCTCTTCCTCAGCCTCCTGCCGGTGGCCTACACCAT 
CATGTCCCTCCCACCCTCCTTTGACTGCGGGCCGTTCAGGTGCAGAGTCTCAGTTGCCCGGGAGC 
ACCTCCCCTCCCGAGGCAGTCTGCTCAGAGGGCCTCGGCCCAGAATTCCAGTTCTGGTTTCATGC 
CAGCCTGTAAAAGGCCATGGAACTTTGGGTGAATCACCGATGCCATTTAAGAGGGTTTTCTGCCA 
GGATGGAAATGTTAGGTCGTTCTGTGTCTGCGGTGTTCATTTCAGTAGCCACCAGCCACCTGTGG 
CCGTTGAGTGCTTGAA ATGAG GAACTGAGAAAATTAATTTCTCATGTATTTTTCTCATTTATTTA 
TTAATTT TTAACTGATAGT TGTACATATT TGGGGGTACATGTGATAT TTGGATACATGTATACAA 
TAT ATAATGATCAAATCAGGGTAACTGGGATAT CCATCACAT CAAACAT T TAT T T T T TAT TC T T T 
T TAGACAGAGT C TCAC TC TGT CACC CAGGCT GGAGT GCAG TGGT GCCATC T CAGC TTACT GCAAC 
CTCTGCCTGCCAGGTTCAAGCGATTCTCATGCCTCCACCTCCCAAGTAGCTGGGACTACAGGCAT 
GCACCACAATGCCCAACTAATTTTTGTATTTTTAGTAGAGACGGGGTTTTGCCATGTTGCCCAGG 
CTGGCCTTGAACTCCTGGCCTCAAACAATCCACTTGCCTCGGCCTCCCAAAGTGTTATGATTACA 
GGCGTGAGCCACCGTGCCTGGCCTAAACATTTATCTTTTCTTTGTGTTGGGAACTTTGAAATTAT 
ACAATGAATTATTGTTAACTGTGATCTCCCTGCTGTGCTATGGAACACTGGGACTTCTTCCCTCT 
ATCTAACTGTATATTTGTACCAGTTAACCAACCGTACTTCATCCCCACTCCTCTCTATCCTTCCC 
AACCTCTGATCACCTCATTCTACTCTCTACCTCCATGAGATCCACTTTTTTAGCTCCCACATGTG 
AGTAAGAAAATGCAATATTTGTCTTTCTGTGCCTGGCTTATTTCACTTAACATAATGACTTCCTG 
TTCCATCCATGTTGCTGCAAATGACAGGATTTCGTTCTTAATTTCAATTAAAATAACCACACATG 
GCAAAAA 
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FIGURE lao 

MGLLLLVLFLSLLPVAYT IMSLPPSFDCGPFRCRVSVAREHLPSRGSLLRGPRPRIPVLVSCQPV 
KGHGTLGES PMP FKRVFCQDGNVRS FCVCAVHFS SHQPP VAVE CLK 

Important features of the protein: 
Signal peptide: 

amino acids 1-18 

N-myristoylation site. 

amino acids 86-92 

Zinc carboxypeptidases , zinc-binding region 2 signature. 

amino acids 68-79 
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FIGURE 131 



TT CTGAAGT AAC GG AAGCT ACCTT GT AT AAAG ACCTC AAC ACT GCTGACC ATGA T C AGCGC AGCCTGGAGC 
AT C TT C CTCATCGGGACT AAAAT T GGGCTGTTCCT T CAAGT AGCACCT CT AT CAGT TAT GGC TAAAT CCT G 
TC CAT C TGTGTGTCGCTGCGATGC GG GTTT CATTT ACT GT AATGAT CG CTTT CT GAC ATCC AT T CC AACAG 
GAAT AC C AGAGGAT GCTACAACTCTC TACC TT CAGAAC AACC AAAT AAAT AAT GCT GGGATT CC TTC AGAT 
TT GAAAAACT TGCT GAAAGT AGAAAG AAT AT ACCTATACC AC AACAGT T TAG AT GAAT T TCCT ACCAACCT 
CC CAAAGT AT GT AAAAGAGT TACATT T GCAAG AAAATAAC AT AAGGACT ATC AC TT AT GATT C ACTTT C AA 
AAATTCCCTATCTGGAAGAATTACATTTAGATGACAACTCTGTCTCTGCAGTTAGCATAGAAGAGGGAGCA 
TTCCGAGACAGCAACTATCTCCGACTGCTTTTCCTGTCCCGTAATCACCTTAGCACAATTCCCTGGGGTTT 
GC CCAGGACT AT AG AAGAACT ACGCTTGG ATGAT AATC GC AT AT CC AC T ATT TC AT C ACC AT CT CTT C AAG 
GT CTCACT AGTC T AAAACGC CTGGTT CT AG AT GGAAAC C TGT TGAACAAT CAT GGTT T AGGT GACAAAGT T 
T T CT T C AACCT AGT T AAT TT GACAGAGCTGTC CCT GGT GCGGAATTCC CT GACT GCTGCACC AGTAAACC T 
TCCAGGCACAAACCTGAGGAAGCTTTATCTTCAAGATAACCACATCAATCGGGTGCCCCCAAATGCTTTTT 
CTTATCTAAGGCAGCTCTATCGACTGGATATGTCCAATAATAACCTAAGTAATTTACCTCAGGGTATCTTT 
GATG AT T TGG AC AAT AT AAC AC AACT GATT CT T CGC AAC AAT CC CT GGT ATTGC GGGTGC AAGAT G AAAT G 
GGT ACGT GAC TGGT T ACAAT CACT ACC TGT GAAGGT CAACGT GC GTGGGCTC AT GT GCC AAGCC CC AGAAA 
AGGTTC GT GGGATGGC TAT T AAGGAT CT CAAT GC AG AACT GT TT GATT GT AAGG AC AGTGGGAT TGTAAGC 
ACCATTCAGATAACCACTGCAATACCCAACACAGTGTATCCTGCCCAAGGACAGTGGCCAGCTCCAGTGAC 
CAAACAGCC AGATAT TAAGAAC CCCAAGCT CACT AAGGAT CAACAAACCACAGGGAGTCCCT C AAG AAAAA 
CAATTACAAT TACT GT GAAGTCTGT C ACCT CT GAT ACC ATTCATATCT CT TGGAAACT T GCT CT ACCT AT G 
AC TGCT TTGAGACT C AGCT GGCT T AAACTGGGCC AT AG C C CGGC ATT T GGATCT AT AACAGAAACAAT TGT 
AACAGG GGAACGC AGT GAGT ACT T GGTC AC AGCCCT GG AGCCTGATTCAC CCT ATAAAGT AT GC AT GGT T C 
CC AT GGAAAC C AGC AACCTCTACC T ATT TGAT GAAACT CCTGTT TGT ATT GAGACT GAAACT GC ACCCCT T 
CGAATGTACAACCCT ACAACC ACC C T CAATCGAGAGCAAG AGAAAG AACC T T AC AAAAACCCC AAT TT AC C 
TTTGGCTGCCATCATTGGTGGGGCTGTGGCCCTGGTTACCATTGCCCTTCTTGCTTTAGTGTGTTGGTATG 
TTCAT AGG AATGGAT C GCTCTT CT CAAGGAACT GT GCAT AT AGC AAAGGGAGGAGAAGAAAGGATGACT AT 
GCAGAAGCTGGCACTAAGAAGGACAACTCTATCCTGGAAATCAGGGAAACTTCTTTTCAGATGTTACCAAT 
AAGC AAT GAACCC ATCTC GAAGGAGGAGTT TGT AAT AC AC ACC ATATT TCCTCC TAATGGAATGAATC TGT 
ACAAAAAC AATC AC AGTGAAAGC AGT AGT AACCG AAGCT AC AGAGACAGT GGT ATT CCAGACTC AGAT CAC 
TC AC AC TC ATGATGCT GAAGGACT CAC AGC AGAC TT GT GTTT T GGGTT TT TT AAACCT AAGGGAGGTG AT G 
GT 
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FIGURE 122 

MISAAWSIFLIGTKIGLFLQVAPLSVMAKSCPSVCRCDAGFIYCNDRFLTSIPTGIPEDATTLYL 
QNNQINNAGIPSDLKNLLKVERIYLYHNSLDEFPTNLPKYVKELHLQENNIRTITYDSLSKIPYL 
EELHLDDNS VSAVS I EEGAFRDSNYLRLLFLSRNHL S TI PWGLPRT I EELRLDDNRI S T I S S PSL 
QGLTSLKRLVLDGNLLNNHGLGDKVFFNLVNLTELSLVRNSLTAAPVNLPGTNLRKLYLQDNHIN 
RVPPNAFSYLRQLYRLDMSNNNLSNLPQGIFDDLDNITQLILRNNPWYCGCKMKTWRDWLQSLPV 
KVNVRGLMCQAPEKVRGMAI KDLNAEL FDCKDSG I VS T I Q I TTAI PNTVY PAQGQWPAP VTKQP D 
IKNPKLTKDQQTTGSPSRKTIT I TVKSVTSDT IHI SWKLALPMTALRLSWLKLGHSPAFGS I TET 
I VTGERSE YLVTALE PDS P YKVCMVPMET SNLYLFDE TP VC I E TETAPLRMYNPT TTLNREQEKE 
PYKNPNLPLAAIIGGAVALVTIALLALVCWYVHRNGSLFSRNCAYSKGRRRKDDYAEAGTKKDNS 
ILEIRETSFQMLPISNEPISKEEFVIHTIFPPNGMNLYKNNHSESSSNRSYRDSGIPDSDHSHS 

Important features of the protein: 
Signal peptide : 
amino acids 1-28 
Transmembrane domain: 
amino acids 531-552 
N-glycosylation sites. 

amino acids 226-229, 282-285, 296-299, 555-558, 626-629, 633-636 
Tyrosine kinase phosphorylation site, 
amino acids 515-522 
N-myristoylation sites. 

amino acids 12-17, 172-177, 208-213, 359-364, 534-539, 556-561, 
640-645 

Amidation site. 

amino acids 567-570 
Leucine zipper pattern. 

amino acids 159-180 

Phospholipase A2 aspartic acid active site. 

amino acids 34-44 



AppJD=10063565 



Page 284 



FIGURE iaa 



CCGTCATCCCCCTGCAGCCACCCTTCCCAGAGTCCTTTGCCCAGGCCACCCCAGGCTTCTTGGCA 
GCCCTGCCGGGCCACTTGTCTTCATGTCTGCCAGGGGGAGGTGGGAAGGAGGTGGGAGGAGGGCG 
T GCAGAGGCAGTCT GGGCT TGGCCAGAGCTCAGGGTGC TGAGC GT GT GACCAGCAGT GAGCAGAG 
GCCGGCCATGGCCAGCCTGGGGCTGCTGCTCCTGCTCTTACTGACAGCACTGCCACCGCTGTGGT 
CCTCCTCACTGCCTGGGCTGGACACTGCTGAAAGTAAAGCCACCATTGCAGACCTGATCCTGTCT 
GCGCTGGAGAGAGCCACCGTCTTCCTAGAACAGAGGCTGCCTGAAATCAACCTGGATGGCATGGT 
GGGGGTCCGAGTGCTGGAAGAGCAGCTAAAAAGTGTCCGGGAGAAGTGGGCCCAGGAGCCCCTGC 
TGCAGCCGCTGAGCCTGCGCGTGGGGATGCTGGGGGAGAAGCTGGAGGCTGCCATCCAGAGATCC 
CTCCACTACCTCAAGCTGAGTGATCCCAAGTACCTAAGAGAGTTCCAGCTGACCCTCCAGCCCGG 
GTTTTGGAAGCTCCCACATGCCTGGATCCACACTGATGCCTCCTTGGTGTACCCCACGTTCGGGC 
CCCAGGACTCATTCTCAGAGGAGAGAAGTGACGTGTGCCTGGTGCAGCTGCTGGGAACCGGGACG 
GACAGCAGCGAGCCCTGCGGCCTCTCAGACCTCTGCAGGAGCCTCATGACCAAGCCCGGCTGCTC 
AGGCTACTGCCTGTCCCACCAACTGCTCTTCTTCCTCTGGGCCAGAATGAGGGGATGCACACAGG 
GACCACTCCAACAGAGCCAGGACTATATCAACCTCTTCTGCGCCAACATGATGGACTTGAACCGC 
AGAGCTGAGGCCATCGGATACGCCTACCCTACCCGGGACATCTTCATGGAAAACATCATGTTCTG 
TGGAATGGGCGGCTTCTCCGACTTCTACAAGCTCCGGTGGCTGGAGGCCATTCTCAGCTGGCAGA 
AACAGCAGGAAGGATGCT T CGGGGAGCCT GATGC TGAAGATGAAGAATTAT CTAAAGC TAT TCAA 
TATCAGCAGCATTTT TCGAGGAGAGTGAAGAGGCGAGAAAAACAATT TCCAGAT TGTCGCTCTGT 
TGCTCAGGCTGGAGTACAGTGGCGCAATCTCGGCTCACTGCAACCTTTGCCTCCTGGGTTCAAGC 
AATTCTCTTGCCTCATCCTCCCGAGTAGCTGGGACTACAGGAGCGTGCCACCATACCTGGCTAAT 
TTTTATATTTTTTTAGTAGAGACAGGGTTTCATCATGTTGCTCATGCTGGTCTCGAACTCCTGAT 
CTCAAGAGATCCGCCCACCTCAGGCTCCCAAAGTGTGGGATTATAGGTGTGAGCCACCGTGTCTG 
GCTGAAAAGCAC T T T CAAAGAGACT G TGT TGAATAAAGGGCCAAGGT TCT T GCCACCCAGCACTC 
ATGGGGGCTCTCTCCCCTAGATGGCTGCTCCTCCCACAACACAGCCACAGCAGTGGCAGCCCTGG 
GTGGCTTCCTATACATCCTGGCAGAATACCCCCCAGCAAACAGAGAGCCACACCCATCCACACCG 
C CACCACCAAGCAGCCGC TGAGACGGACGGTT CCATGCCAGCT GCCT GGAGGAGGAACAGACCCC 
T T TAGTC CT CAT CCC TTAGATCC TGGAGGGCACGGAT CACATCC TGGGAAGAAGGCAT C TGGAGG 
ATAAGCAAAGCCACCCCGACACCCAATCTTGGAAGCCCTGAGTAGGCAGGGCCAGGGTAGGTGGG 
GGCCGGGAGGGACCCAGGTGTGAACGGATGAATAAAGTTCAACTGCAAGTGAAAAAAAAAAA 
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FIGURE 134 

MSi^GRWEGGGRRACRGSLGLARAQGAERVTSSEQRPAMASLGLLLLLLLTALPPLWSSSLPGLD 
TAESKATIADLILSALERATVFLEQRLPEINLDGMVGVRVLEEQLKSVREKWAQEPLLQPLSLRV 
GMLGEKLEAAIQRSLHYLKLSDPKYLREFQLTLQPGFWKLPHAWIHTDASLVYPTFGPQDSFSEE 
RSDVCLVQLLGTGTDSSEPCGLSDLCRSLMTKPGCSGYCLSHQLLFFLWARMRGCTQGPLQQSQD 
YINLFCANMMDLNRRAEAIGYAYPTRDIFMENIMFCGMGGFSDFYKLRWLEAILSWQKQQEGCFG 
EPDAEDEELSKAIQYQQHFSRRVKRREKQFPDSRSVAQAGVQWRNLGSLQPLPPGFKQFSCLILP 
SSWDYRSVPPYLANFYIFLVETGFHHVAHAGLELLISRDPPTSGSQSVGL 

Important features of the protein: 
Signal peptide: 

amino acids 1-26 

Transmembrane domain: 

amino acids 39-56 

Tyrosine kinase phosphorylation sites. 

amino acids 149-156, 274-282 

N-myristoylation sites. 

amino acids 10-16, 20-26, 63-69, 208-214 

Amidation site. 

amino acids 10-14 

Glycoprotein hormones beta chain signature 1 . 

amino acids 230-237 
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FIGURE 135 



GGTCTGAGTGCAGAGCTGCTGTCATGGCGGCCGCTCTGTGGGGCTTCTTTCCCGTCCTGCTGCTG 
CTGCTGCTATCGGGGGATGTCCAGAGCTCGGAGGTGCCCGGGGCTGCTGCTGAGGGATCGGGAGG 
GAGTGGGGTCGGCATAGGAGATCGCTTCAAGATTGAGGGGCGTGCAGTTGTTCCAGGGGTGAAGC 
CTCAGGACTGGATCTCGGCGGCCCGAGTGCTGGTAGACGGAGAAGAGCACGTCGGTTTCCTTAAG 
ACAGATGGGAGTTTTGTGGTTCATGATATACCTTCTGGATCTTATGTAGTGGAAGTTGTATCTCC 
AGC T T ACAGAT T TGATCCCGTT CGAGTGGATATCAC T TCGAAAGGAAAAATGAGAGCAAGATAT G 
T GAAT TACAT CAAAACAT CAGAGGT T GT CAGAC TGCCCTATCCT CT CCAAATGAAATC T T CAGGT 
CCACCTTCTTACTTTATTAAAAGGGAATCGTGGGGCTGGACAGACTTTCTAATGAACCCAATGGT 
TATGATGATGGTTCTTCCTTTATTGATATTTGTGCTTCTGCCTAAAGTGGTCAACACAAGTGATC 
CTGACATGAGACGGGAAATGGAGCAGT CAATGAATATGC T GAAT TCCAACCATGAGT T GCC T GAT 
GTTTCTGAGTTCATGACAAGACTCTTCTCTTCAAAATCATCTGGCAAATCTAGCAGCGGCAGCAG 
TAAAAGAGGCAAAAGTGGGGCTGGCAAAAGGAGGTAGTCAGGCCGTCCAGAGCTGGCATTTGCAC 
AAACACGGCAACAGT GGGTGGCATC CAAGT CT TGGAAAACCGT GT GAAGCAACTACTATAAAC TT 
GAGTCATCCCGACGTTGATCTCTTACAACTGTGTATGTT 

AACTTTTTAGCACATGTTTTGTACTTGGTACACGAGAAAACCCAGCTTTCATCTTTTGTCTGTAT 
GAGGTCAAT AT TGATGTCAC TGAAT TAAT T ACAGTGT CC TATAGAAAAT GCCAT TAATAAAT TAT 
AT GAAC T AC TATACAT T AT G TAT AT T AAT T AAAACAT C T T AAT C CAGAAAT C AAAAAAAAAAAAA 
AAAAAAAAAAAAAAA 
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FIGURE 126 

MAAALWGFFPVLLLLLLSGDVQSSEVPGAAAEGSGGSGVGIGDRFKIEGRAWPGVKPQDWISAA 
RVLVDGEEHVGFLKTDGSFVVHDIPSGSYVVEWSPAYRFDPVRVDITSKGKMRARYWyiKTSE 
VTOLPYPLQMKSSGPPSYFIKRESWGWTDFLMNPMViyQylMVLPLLIFVL^ 
QSMNMLNSNHELPDVSEFMTRLFSSKSSGKSSSGSSKTGKSGAGKRR 



m 

31 
111 

ill 

ry 



Important features of the protein: 
Signal sequence : 

amino acids 1-23 



O Transmembrane domain: 

hV- amino acids 161-182 



N-glycosylation site, 

amino acids 184-187 

Glycosaminoglycan attachment sites. 

amino acids 37-40, 236-239 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 151-154 

N-myristoylation sites. 

amino acids 33-38, 36-41, 38-44, 229-234 

Amidation site. 

amino acids 238-241 

ATP/GTP-binding site motif A (P-loop) . 

amino acids 229-236 
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FIGURE 137 



GATGGCGCAGCCACAGCTTCTGTGAGATTCGATTTCTCCCCAGTTCCCCTGTGGGTCTGAGGGGA 
CCAGAAGGGTGAGCTACGTTGGCTTTCTGGAAGGGGAGGCTATATGCGTCAATTCCCCAAAACAA 
GTTTTGACATTTCCCCTGAAATGTCATTCTCTATCTATTCACTGCAAGTGCCTGCTGTTCCAGGC 
CTTACCTGCTGGGCACTAACGGCGGAGCCAGGATGGGGACAGAATAAAGGAGCCACGACCTGTGC 
CACCAACTCGCACTCAGACTCTGAACTCAGACCTGAAATCTTCTCTTCACGGGAGGCTTGGCAGT 
TTTTCTTACTCCTGTGGTCTCCAGATTTCAGGCCTAAGATGAAAGCCTCTAGTCTTGCCTTCAGC 
CTTCTCTCTGCTGCGTTTTATCTCCTATGGACTCCTTCCACTGGACTGAAGACACTCAATTTGGG 
Ui AAGCTGTGTGATCGCCACAAACCTTCAGGAAATACGAAATGGATTTTCTGAGATACGGGGCAGTG 
f'\ TGC7VAGCCAAAGATGGAAACATTGACATCAGAATCTT7VAGGAGGACTGAGTCTTTGCAAGACACA 
Cf AAGCCTGCGAATCGATGCTGCCTCCTGCGCCATTTGCTAAGACTCTATCTGGACAGGGTATTTAA 

£m AAACTACCAGACCCCTGACCATTATACTCTCCGGAAGATCAGCAGCCTCGCCAATTCCTTTCTTA 

id 

CCAT C AAGAAGGACC TCCGGC T C TC TC AT GCCCACATGACATGCCAT T GT GGGGAGGAAGCAAT G 



U! 



AAGAAATAC AGCCAGAT TC TGAGT CAC T T TGAAAAGCT GGAACCTCAGGCAGCAGT T GT GAAGGC 



[J] T T TGGGGGAAC TAGACAT T C T TCTGCAAT GGAT GGAGGAGACAGAATAGGAGGAAAGT GAT GCT G 

g C TGCT AAGAATATTCGAGGTCAAGAGCTC CAGT CT TCAATACC T GCAGAGGAGGCATGACCCCAA 

ACCACCATCTCTTTACTGTACTAGTCTTGTGCTGGTCACAGTGTATCTTATTTATGCATTACTTG 
CTTCCTTGCATGATTGTCTTTATGCATCCCCAATCTTAATTGAGACCATACTTGTATAAGATTTT 

ri 

wi T GTAATATC T T T CTGC TAT TGGATATAT T TAT TAGT TAATATAT TTAT T T AT TT T T TGC TAT T T A 

\ U 

Q ATGTATT TATTT TT TTAC TTGGACATGAAACTTTAAAAAAATTCACAGATTATAT TTATAACCTG 

til ACTAGAGCAGGTGATGTATTTTTATACAGTAAAAAAAAAAAACCTTGTAAATTCTAGAAGAGTGG 

CTAGGGGGGTTATTCATTTGTATTCAACTAAGGACATATTTACTCATGCTGATGCTCTGTGAGAT 
ATT TGAAAT T GAACCAATGACTACT T AGGATGGGT T GT GGAATAAGT T T T GATGT GGAATT GCAC 
ATC TACCT TACAAT TACTGACCAT CCCCAGTAGAC T CCCCAGT CCCATAAT TGTGTATC TT CCAG 
CCAGGAAT CCTACACGGCCAGCAT GTAT T T CTACAAATAAAGT T TTCT T TGCATACCAAAAAAAA 
AAAAAAAAAAA 
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FIGURE 138 

MRQFPKTSFDISPEMSFSIYSLQVPAVPGLTCWALTAEPGWGQNKGATTCATNSHSDSELRPEIF 
SSREAWQFFLLLWSPDFRPKMKASSLAFSLLSAAFY 

FSE I RGS VQAKDGNI D I RI LRRTES LQDTKPANRCCLLRHLLRLYLDRVFKNYQT PDHYTLRKI S 

SLANSFLTIKKDLRLSHAHMTCHCGE 

E 

Important features of the protein: 
Signal peptide: 

amino acids 1-42 

cAMP- and cGMP-dependent protein kinase phosphorylation sites. 

amino acids 192-195, 225-228 

N-myristoylation sites. 

amino acids 42-47, 46-51, 136-141 
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FIGURE 13Q 



CCTGGAGCCGGAAGCGCGGCTGCAGCAGGGCGAGGCTCCAGGTGGGGTCGGTTCCGCATCCAGCC 
TAGCGTGTCCACGATGCGGCTGGGCTCCGGGACTTTCGCTACCTGTTGCGTAGCGATCGAGGTGC 
TAGGGATCGCGGTCTTCCTTCGGGGATTCTTCCCGGCTCCCGTTCGTTCCTCTGCCAGAGCGGAA 
CACGGAGCGGAGCCCCCAGCGCCCGAACCCTCGGCTGGAGCCAGTTCTAACTGGACCACGCTGCC 
ACCACCTCTCTTCAGTAAAGTTGTTATTGTTCTGATAGATGCCTTGAGAGATGATTTTGTGTTTG 
GGT CAAAGGG T GTGAAAT T TAT GCCCTACACAACT TACC T T GT GGAAAAAGGAGCAT C T CACAGT 
TTTGTGGCTGAAGCAAAGCCACCTACAGTTACTATGCCTCGAATCAAGGCATTGATGACGGGGAG 
CCTTCCTGGCTTTGTCGACGTCATCAGGAACCTCAATTCTCCTGCACTGCTGGAAGACAGTGTGA 
TAAGACAAGCAAAAGCAGC TGGAAAAAGAAT AGTC T T TTATGGAGAT GAAAC CT GGGT TAAAT TA 
T TCCCAAAGCAT TT TGTGGAATATGATGGAACAACCTCATTTT TCGT GTCAGAT TACACAGAGGT 
GGATAATAAT GTCACGAGGCAT T T GGATAAAGTAT TAAAAAGAGGAGATT GGGACATAT TAATCC 
U t , T CCACTACC T GGGGC TGGACCACAT T GGC CACATT TCAGGGCCCAACAGCC CCC TGAT T GGGCAG 

%l AAGC T GAGCGAGATGGACAGCGT GC T GAT GAAGATCCACACCT CACTGCAGT CGAAGGAGAGAGA 

y GACGCCTTTACCCAATTTGCTGGTTCTTTGTGGTGACCATGGCATGTCTGAAACAGGAAGTCACG 
0 GGGCCTCCTCCACCGAGGAGGTGAATACACCTC TGAT TT TAAT CAGTTCT GCGT TTGAAAGGAAA 

tfl CCCGGTGATATCCGACATCCAAAGCACGTCCAATAGACGGATGTGGCTGCGACACTGGCGATAGC 
fl \ ACTTGGCTTACCGATTCCAAAAGACAGTGTAGGGAGCCTCCTATTCCCAGTTGTGGAAGGAAGAC 
?Z CAAT GAGAGAGCAGTTGAGATTTTTACAT TTGAATACAGTGCAGCT TAGTAAAC TGTTGCAAGAG 

l £l AATGTGCCGTCATATGAAAAAGATCCTGGGTTTGAGCAGTTTAAAATGTCAGAAAGATTGCATGG 
ffl GAACTGGATCAGACTGTAC TT GGAGGAAAAGCAT T CAGAAGTC C TAT T CAACCT GGGC T CCAAGG 

Ul TTCTCAGGCAGTACCTGGATGCTCTGAAGACGCTGAGCTTGTCCCTGAGTGCACAAGTGGCCCAG 
I ' TTCTCACCCTGCTCCTGCTCAGCGTCCCACAGGCACTGCACAGAAAGGCTGAGCTGGAAGTCCCA 
% % CTGTCATCTCCTGGGTTTTCTCTGCTCTTTTATTTGGTGATCCTGGTTCTTTCGGCCGTTCACGT 
h % CATTGTGTGCACCTCAGCTGAAAGTTCGTGCTACTTCTGTGGCCTCTCGTGGCTGGCGGCAGGCT 

GCCTTTCGTTTACCAGACTCTGGTTGAACACCTGGTGTGTGCCAAGTGCTGGCAGTGCCCTGGAC 
CI AGGGGGCCTCAGGGAAGGACGTGGAGCAGCCTTATCCCAGGCCTCTGGGTGTCCCGACACAGGTG 
f\\ TTCACATCTGTGCTGTCAGGTCAGATGCCTCAGTTCTTGGAAAGCTAGGTTCCTGCGACTGTTAC 

CAAGGTGATTGTAAAGAGCTGGCGGTCACAGAGGAACAAGCCCCCCAGCTGAGGGGGTGTGTGAA 
TCGGACAGCCTCCCAGCAGAGGTGTGGGAGCTGCAGCTGAGGGAAGAAGAGACAATCGGCCTGGA 
Til CACTCAGGAGGGTCAAAAGGAGACTTGGTCGCACCAGTCATCCTGCCACCCCCAGAATGCATCCT 

GCCTCATCAGGTCCAGATTTCTTTCCAAGGCGGACGTTTTCTGTTGGAATTCTTAGTCCTTGGCC 
TCGGACACCTTCATTCGTTAGCTGGGGAGTGGTGGTGAGGCAGTGAAGAAGAGGCGGATGGTCAC 
ACT C AGAT C CACAGAGCC CAGGATC AAG G GAC C CAC T GCAG T G GC AG CAG GAC T G T TG GGC C CC C 
ACCCCAACCCTGCACAGCCCTCATCCCCTCTTGGCTTGAGCCGTCAGAGGCCCTGTGCTGAGTGT 
C TGAC CGAGACAC TCACAGC T T T GT CAT CAGGGCACAGGC T TCC TCGGAGC CAGGATGATC TGT G 
CCACGCTTGCACCTCGGGCCCATCTGGGCTCATGCTCTCTCTCCTGCTATTGAATTAGTACCTAG 
CTGCACACAGTATGTAGT TACCAAAAGAATAAACGGCAATAAT T GAGAAAAAAAA 
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FIGURE 14Q 

MRLGSGTFATCCVAIEVLGIAVFLRGFFPAPVRSSARAEHGAEPPAPEPSAGASSNWTTLPPPLF 
SKWIVLIDALRDDFVFGSKGVKMPYTT^^ 

VDVIRNLNSPALLEDSVIRQAKAAGKRIVFYGDETWVKLFPKHFVEYDGTTSFFVSDYTEVDNNV 
TRHLDKVLKRGDWDILILHYLGLDHIGHISGPNSPLIGQKLSEMDSVLMKIHTSLQSKERETPLP 
NLLVLCGDHGMS ETGSHGAS S TEE VNT PL I L I S SAFERKPGDIRHPKHVQ 

Important features of the protein: 
Signal peptide: 

amino acids 1-34 

Transmembrane domain: 

amino acids 58-76 

N-glycosylation sites. 

amino acids 56-60, 194-198 

N-myristoylation sites. 

amino acids 6-12, 52-58, 100-106, 125-131, 233-239, 270-276, 
275-281, 278-284 

Amidation site. 

amino acids 154-158 

Cell attachment sequence. 

amino acids 205-208 
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FIGURE 141 



GGCACGAGGCAAGCCTTCCAGGTTATCGTGACGCACCTTGAAAGTCTGAGAGCTACTGCCCTACA 
GAAAGTTACTAGTGCCCTAAAGCTGGCGCTGGCACTGATGTTACTGCTGCTGTTGGAGTACAACT 
TCCCTATAGAAZVA.CAACTGCCAGCACCTTAAGACCACTCACACCTTCAGAGTGAAGAACTTAAAC 
CCGAAGAAATTCAGCATTCATGACCAGGATCACAAAGTACTGGTCCTGGACTCTGGGAATCTCAT 
AGCAGTTCCAGATAAAAACTACATACGCCCAGAGATCTTCTTTGCATTAGCCTCATCCTTGAGCT 
CAGCCTCTGCGGAGAAAGGAAGTCCGATTCTCCTGGGGGTCTCTAAAGGGGAGTTTTGTCTCTAC 
TGTGACAAGGATAAAGGACAAAGTCATCCATCCCTTCAGCTGAAGAAGGAGAAACTGATGAAGCT 
GGCTGCCCAAAAGGAATCAGCACGCCGGCCCTTCATCTTTTATAGGGCTCAGGTGGGCTCCTGGA 
ACATGCTGGAGTCGGCGGCTCACCCCGGATGGTTCATCTGCACCTCCTGCAATTGTAATGAGCCT 
GTT GGGGT GACAGATAAAT T TGAGAACAGGAAACACAT TGAAT T TT CATT TCAACCAGT T TGCAA 
AGCTGAAATGAGCCCCAGTGAGGTCAGCGATTAGGAAACTGCCCCATTGAACGCCTTCCTCGCTA 
ATTTGAACTAATTGTATAAAAACACCAAACCTGCTCACT 
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FIGURE 142 

MLLLLLE YNFP I ENNCQHLKTTHT FRVKNLNPKKFS I HDQDHKVLVLDS GNL I AVPDKNYI RPE I 
FFALASSLSSASAEKGSPILLGVSKGEFCLYCDKDKGQSHPSLQLKKEKLMKLAAQKESARRPFI 
FYRAQVGSWNMLESAAHPGWFICTSCNCNEPVGVTDKFENRKHIEFSFQPVCKAEMSPSEVSD 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 33-36 

N-myristoylation site. 

r~% amino acids 50-55, 87-92 

Interleukin-1 

amino acids 37-182 
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FIGURE 143 

CTAGAGAGTATAGGGCAGAAGGATGGCAGATGAGTGACTCCACATCCAGAGCTGCCTCCCTTTAA 
TCCAGGATCCTGTCCTTCCTGTCCTGTAGGAGTGCCTGTTGCCAGTGTGGGGTGAGACAAGTTTG 
TCCCACAGGGCTGTCTGAGCAGATAAGATTAAGGGCTGGGTCTGTGCTCAATTAACTCCTGTGGG 
CACGGGGGCTGGGAAGAGCAAAGTCAGCGGTGCCTACAGTCAGCACCATGCTGGGCCTGCCGTGG 
AAGGGAGGTCTGTCCTGGGCGCTGCTGCTGCTTCTCTTAGGCTCCCAGATCCTGCTGATCTATGC 
CTGGCATTTCCACGAGCAAAGGGACTGTGATGAACACAATGTCATGGCTCGTTACCTCCCTGCCA 
CAGTGGAGT T T GCTGTCCACACAT T CAACCAACAGAGCAAGGAC TAC TAT GC C TACAGAC TGGGG 
CACAT CT T GAAT TCC T GGAAGGAGCAGGT GGAGTCCAAGAC TGTATT C TCAATGGAGCTAC TGCT 
GGGGAGAACTAGGT G T GGGAAAT T T GAAGACGACAT T GACAACTGCCAT T T C CAAGAAAGCACAG 
AGCTGAACAATACTTTCACCTGCTTCTTCACCATCAGCACCAGGCCCTGGATGACTCAGTTCAGC 
CTCCTGAACAAGACCTGCTTGGAGGGATTCCACTGAGTGAAACCCACTCACAGGCTTGTCCATGT 
GCTGCTCCCACATTCCGTGGACATCAGCACTACTCTCCTGAGGACTCTTCAGTGGCTGAGCAGCT 
TTGGACTTGTTTGTTATCCTATTTTGCATGTGTTTGAGATCTCAGATCAGTGTTTTAGAAAATCC 
ACACATCT T GAGCC TAATCAT GTAG T GTAGATCAT TAAACATCAGCAT T T T AAGAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 144 

MLGLPWKGGLSWALLLLLLGSQILLIYAWHFHEQRDCDEHNVMARYLPATVEFAVHTFNQQSKDY 
YAYRLGHILNSWKEQVESKTVFSMELLLGRTRCGKFEDDIDNCHFQESTELNNTFTCFFTISTRP 
WMTQFSLLNKTCLEGFH 

Important features of the protein: 
Signal peptide: 

amino acids 1-25 

N-glycosylation sites. 

amino acids 117-121,, 139-143 

N-myristoylation site. 

amino acids 9-15 
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FIGURE i4fl 

CTGTGCAGCTCGA6GCTCCAGAGGCACACTCCAGAGAGAGCCAAGGTTCTGACGCGATGAGGAAG 
CACCTGAGCTGGTGGTGGCTGGCCACTGTCTGCATGCTGCTCTTCAGCCACCTCTCTGCGGTCCA 
GACGAGGGGCATCAAGCACAGAATCAAGTGGAACCGGAAGGCCCTGCCCAGCACTGCCCAGATCA 
CTGAGGCCCAGGTGGCTGAGAACCGCCCGGGAGCCTTCATCAAGCAAGGCCGCAAGCTCGACATT 
GACT T CGGAGCCGAGGGCAACAGGT ACTACGAGGCCAACTACT GGCAGTT CCCCGAT GGCAT CCA 
CTACAACGGCTGCTCTGAGGCTAATGTGACCAAGGAGGCATTTGTCACCGGCTGCATCAATGCCA 
CCCAGGCGGCGAACCAGGGGGAGTTCCAGAAGCCAGACAACAAGCTCCACCAGCAGGTGCTCTGG 
L . CGGCTGGTCCAGGAGCTCTGCTCCCTCAAGCATTGCGAGTTTTGGTTGGAGAGGGGCGCAGGACT 
m TCGGGTCACCATGCACCAGCCAGTGCTCCTCTGCCTTCTGGCTTTGATCTGGCTCATGGTGAAAT 
Q AAGCTTGCCAGGAGGCTGGCAGTACAGAGCGCAGCAGCGAGCAAATCCTGGCAAGTGACCCAGCT 
CH CTTCTCCCCCAAACCCACGCGTGTTCTGAAGGTGCCCAGGAGCGGCGATGCACTCGCACTGCAAA 
H! TGCCGCTCCCACGTATGCGCCCTGGTATGTGCCTGCGTTCTGATAGATGGGGGACTGTGGCTTCT 

m 

^ CCGTCACTCCATTCTCAGCCCCTAGCAGAGCGTCTGGCACACTAGATTAGTAGTAAATGCTTGAT 

if! GAGAAGAACACATCAGGCACTGCGCCACCT GCT TCACAGTACT TCCCAACAACTCTTAGAGGTAG 

% GTGTATTCCCGTTTTACAGATAAGGAAACTGAGGCCCAGAGAGCTGAAGTACTGCACCCAGCATC 

U ACCAGCTAGAAAGTGGCAGAGCCAGGATTCAACCCTGGCTTGTCTAACCCCAGGTTTTCTGCTCT 

^| GTCCAATTCCAGAGCTGTCTGGTGATCACTTTATGTCTCACAGGGACCCACATCCAAACATGTAT 

i. j 

Sj CT CTAAT GAAATTGTGAAAGC T CCAT G TT TAGAAATAAATGAAAACACC TGA 

i ^ 
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FIGURE 146 

MRKHLSWWWLATVCMLLFSHLSAVQTRGIKHRIKWNRKALPSTAQITEAQVAENRPGAFIKQGRK 
LDIDFGAEGNRYYEANYWQFPDGIHYNGCSEANVTKEAFVTGCINATQAANFQGKFQKPDNKLHQQ 
VLWRLVQELCSLKHCEFWLERGAGLRVTMHQPVLLCLIjyLIWLMVK 

Important features of* the protein: 
Signal peptide: 

amino acids 1-2 6 

Transmembrane domain: 

amino acids 157-171 

N-glycosylation sites. 

amino acids 98-102, 110-114 

Tyrosine kinase phosphorylation site. 

amino acids 76-83 

N-myristoylation sites. 

amino acids 71-77, 88-94, 93-99, 107-113, 154-160 

Amidation site. 

amino acids 62-66 
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FIGURE 147 

GCCTTGGCCTCCCAAAGGGCTGGGATTATAGGCGTGACCACCATGTCTGGTCCAGAGTCTCATTT 
C CT GAT GATT TATAGAC T CAAAGAAAAC TCATGT TCAGAAGC TCTCT T CT CT TC TGGC CTCC TC T 
CTGTCTTCTTTCCCTCTTTCTTCTTATTTTAATTAGTAGCATCTACTCAGAGTCATGCAAGCTGG 
AAATCTTTCATTTTGCTTGTCAGTGGGGTAGGTCACTGAGTCTTAGTTTTTATTTTTTGAAATTT 
CAACTTTCAGATTCAGGGGGTACATGTGAAGGTTTGTTTTATGAGTATATTGCATGATGCTGAGG 
TTTGGGGT 
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FIGURE 148 



MFRSSLLFWPPLCLLSLFLLILISSIYSESCKLEIFHFACQWGRSLSLSFYFLKFQLSDSGGTCE 
GLFYEYIA 

Important features of the protein: 
Signal peptide: 

amino acids 1-25 

N-myristoylation site. 

amino acids 62-68 
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FIGURE 14Q 



GTCTCCGCGTCACAGGAACTTCAGCACCCACAGGGCGGACAGCGCTCCCCTCTACCTGGAGACTTGAC 
TCCCGCGCGCCCCAACCCTGCTTATCCCTTGACCGTCGAGTGTCAGAGATCCTGCAGCCGCCCAGTCC 
CGGCCCCTCTCCCGCCCCACACCCACCCTCCTGGCTCTTCCTGTTTTTACTCCTCCTTTTCATTCATA 
ACAAAAGCTACAGCTCCAGGAGCCCAGGGCCGGGCTGTGACCCAAGCCGAGCGTGGAAGAATGGGGTT 
CCTCGGGACCGGCACTTGGATTCTGGTGTTAGTGCTCCCGATTCAAGCTTTCCCCAAACCTGGAGGAA 
GCCAAGACAAATCTCTACATAATAGAGAATTAAGTGCAGAAAGACCTTTGAATGAACAGATTGCTGAA 
GCAGAAGAAGACAAGATTAAAAAAACATATCCTCCAGAAAACAAGCCAGGTCAGAGCAACTATTCTTT 
TGT T GATAACTT GAACCTGCTAAAGGCAATAACAGAAAAGGAAAAAAT T GAGAAAGAAAGACAAT CTA 
M TAAGAAGCTCCCCACTT GATAATAAGT TGAATGTGGAAGAT GTTGATT CAACCAAGAATCGAAAACTG 

Q ATCGATGATTATGACTCTACTAAGAGTGGATT GGATCATAAATTTCAAGATGATCCAGATGGTCT TCA 

Q TCAACTAGACGGGACT CCT TTAACCGCTGAAGACAT T GTCCATAAAAT CGCT GCCAGGATTTAT GAAG 

f \ AAAATGACAGAGCCGT GTT TGACAAGATT GTT TCTAAAC TACT TAATCTCGGCCTTATCACAGAAAGC 

fit CAAGCACATACACTGGAAGATGAAGTAGCAGAGGTTTTACAAAAATTAATCTCAAAGGAAGCCAACAA 
rn TTATGAGGAGGATCCCAATAAGCCCACAAGCT GGAC T GAGAATCAGGCTGGAAAAATACCAGAGAAAG 

[|| TGACTCCAATGGCAGCAATT CAAGATGGT CTT GCTAAGGGAGAAAACGATGAAACAGTAT CTAACACA 

is TTAACCT TGACAAATGGCTT GGAAAGGAGAACTAAAACCTACAGT GAAGACAACTT TGAGGAACT CCA 

Q ATATTTCCCAAATTTCTATGCGCTACTGAAAAGTATTGATTCAGAAAAAGAAGCAAAAGAGAAAGAAA 

CACTGATTACTATCAT GAAAACAC TGATT GACT TT GTGAAGATGATGGTGAAATAT GGAACAATATCT 
2t CCAGAAGAAGGTGTTT CCTACCT TGAAAACTTGGAT GAAAT GATT GCT CTT CAGAC CAAAAACAAGCT 

: a J 

K AGAAAAAAATGCTACT GACAATATAAGCAAGCT TTT CCCAGCACCAT CAGAGAAGAGT CAT GAAG AAA 

m CAGACAGTACCAAGGAAGAAGCAGCTAAGATGGAAAAGGAATATGGAAGCTTGAAGGATTCCACAAAA 

GATGATAACTCCAACCCAGGAGGAAAGACAGATGAACCCAAAGGAAAAACAGAAGCCTATTTGGAAGC 
CATCAGAAAAAATATTGAATGGTTGAAGAAACATGACAAAAAGGGAAATAAAGAAGATTATGACCTTT 
CAAAGATGAGAGACTTCATCAATAAACAAGCTGATGCTTATGTGGAGAAAGGCATCCTTGACAAGGAA 
GAAGCCGAGGCCATCAAGCGCATTTATAGCAGCCTGTAAAAATGGCAAAAGATCCAGGAGTCTTTCAA 
CTGTTTCAGAAAACATAATATAGCTTAAAACACTT CTAAT TCTGT GAT TAAAATTTT TTGACCCAAGG 
GTTATTAGAAAGT GCTGAAT TTACAGTAGTTAACC T TTTACAAGT GGTTAAAACATAGCTTTCTT CCC 
GTAAAAACTAT CT GAAAGTAAAGT TGTAT GTAAGCT GAAAAAAAAAAAAAAAAAAAA 
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FIGURE iso 

MGFLGTGTWILVLVLPIQAFPKPGGSQDKSLHNRELSAERPLNEQIAEAEEDKIKKTYPPENKPG 
QSNYSFVDNLNLLKAITEKEKIEKERQSIRSSPLDNKLNVEDVDSTKNRKLIDDY.DSTKSGLDHK 
FQDDPDGLHQLDGTPLTAEDIVHKIAARIYEENDRAVFDKIVSKLLNLGLITESQAHTLEDEVAE 
VLQKLISKEANNYEEDPNKPTSWTENQAGKIPEKVTPMAAIQDGLAKGENDETVSNTLTLTNGLE 
RRTKT YSEDNFEELQYFPNFYALLKS I DSEKEAKEKETL I T IMKTLI DFVKMMVKYGT I S PEEGV 
SYLENLDEMIALQTKNKLEKNATDNISKLFPAPSEKSHEETDSTKEEAAKMEKEYGSLKDSTKDD 
NSNPGGKTDEPKGKTEAYLEAIRKNIEWLKKHDKKGNKEDYDLSKMRDFINKQADAYVEKGILDK 
EEAEAIKRIYSSL 

N-glycosylation sites: 

amino acids 68-71, 346-349, 350-353 

Casein kinase II phosphorylation site: 

amino acids 70-73, 82-85, 97-100, 125-128, 147-150, 188-191, 217- 

220, 265-268, 289-292, 305-308, 320-323, 326-329, 362-365, 368- 
341, 369-372, 382-385, 386-389, 387-390 

N-myristoylation sites: 
amino acids 143-148, 239-244 
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FIGURE 151 



CGGC T CGAGGC TCCCGCGAGGAGAAAGGAACAT T CTGAGGGGAG TCTACACC CT G TGGAGC T CAA 
GATGGTCCTGAGTGGGGCGCTGTGCTTCCGAATGAAGGACTCGGCATTGAAGGTGCTTTATCTGC 
ATAATAACCAGCTTCTAGCTGGAGGGCTGCATGCAGGGAAGGTCATTAAAGGTGAAGAGATCAGC 
GTGGTCCCCAATCGGTGGCTGGATGCCAGCCTGTCCCCCGTCATCCTGGGTGTCCAGGGTGGAAG 
CCAGTGCCTGTCATGTGGGGTGGGGCAGGAGCCGACTCTAACACTAGAGCCAGTGAACATCATGG 
AGCTCTATCTTGGTGCCAAGGAATCCAAGAGCTTCACCTTCTACCGGCGGGACATGGGGCTCACC 
TCCAGCTTCGAGTCGGCTGCCTACCCGGGCTGGTTCCTGTGCACGGTGCCTGAAGCCGATCAGCC 
TGTCAGACTCACCCAGCTTCCCGAGAATGGTGGCTGGAATGCCCCCATCACAGACTTCTACTTCC 
AGCAGTGTGACTAGGGCAACGTGCCCCCCAGAACTCCCTGGGCAGAGCCAGCTCGGGTGAGGGGT 
GAGTGGAGGAGACCCATGGCGGACAATCACTCTCTCTGCTCTCAGGACCCCCACGTCTGACTTAG 
TGGGCACCTGACCACTTTGTCTTCTGGTTCCCAGTTTGGATAAATTCTGAGATTTGGAGCTCAGT 
CCACGGTCCTCCCCCACTGGATGGTGCTACTGCTGTGGAACCTTGTAA?\AACCATGTGGGGTAAA 
CTGGGAATAACATGAAAAGATTTCTGTGGGGGTGGGGTGGGGGAGTGGTGGGAATCATTCCTGCT 
TAATGGTAACTGACAAGTGTTACCCT GAGCCCCGCAGGCCAACCCAT CCCCAGT TGAGCCT TATA 
GGGTCAGTAGCTCTCCACATGAAGTCCTGTCACTCACCACTGTGCAGGAGAGGGAGGTGGTCATA 
GAGTCAGGGATCTATGGCCCTTGGCCCAGCCCCACCCCCTTCCCTTTAATCCTGCCACTGTCATA 
TGCTACCTTTCCTATCTCTTCCCTCATCATCTTGTTGTGGGCATGAGGAGGTGGTGATGTCAGAA 
GAAAT GGCTCGAGC TCAGAAGATAAAAGAT AAGTAGGGTAT GCTGAT CCT CT TT TAAAAACC CAA. 
GATACAATCAAAATCCCAGATGCTGGTCT CTAT TCCCAT GAAAAAGT GCT CATGACATATTGAGA 
AGACC TAC T TACAAAGTGGCAT ATAT T GCAAT T TAT T T TAATT AAAAGATACCTAT TTATATAT T 
TCTTTATAGAAAAAAGTCTGGAAGAGTTTACTTCAATTGTAGCAATGTCAGGGTGGTGGCAGTAT 
AGGTGATTTTTCTTTTAATTCTGTTAATTTATCTGTATTTCCTAATTTTTCTACAATGAAGATGA 
ATT CC TT GT ATAAAAATAAGAAAAGAAAT TAATC TTGAGGTAAGCAGAGCAGACATCATCTCTGA 
TTGTCCTCAGCCTCCACTTCCCCAGAGTAAATTCAAATTGAATCGAGCTCTGCTGCTCTGGTTGG 
T TGTAGT AGTGATCAGGAAACAGAT CT CAGCAAAGCCACTGAGGAGGAGGCT GT GC TGAGT T TGT 
GTGGCTGGAATCTCTGGGTAAGGAACTTAAAGAACAAAAATCATCTGGTAATTCTTTCCTAGAAG 
GAT CACAGC CC C TGGGAT T CCAAGGCATT GGATCCAGT C TC TAAGAAGGC TGCT GTAC TGGT TGA 
ATTGTGTCCCCCTCAAATTCACATCCTTCTTGGAATCTCAGTCTGTGAGTTTATTTGGAGATAAG 
GTCTCTGCAGATGTAGTTAGTTAAGACAAGGTCATGC TGGATGAAGGTAGACCTAAAT TCAATAT 
GAC TGGT T T CCT TGTATGAAAAGGAGAGGACACAGAGACAGAGGAGACGCGGGGAAGAC TATGTA 
AAGATGAAGGCAGAGATCGGAGT TT TGCAGCCACAAGCTAAGAAACACCAAGGATTGTGGCAACC 
ATCAGAAGCT TGGAAGAGGCAAAGAAGAAT TC T T CCC TAGAGGC T TTAGAGGGATAACGGCTC TG 
CTGAAACCTTAATCTCAGACTTCCAGCCTCCTGAACGAAGAAAGAATAAATTTCGGCTGTTTTAA 
GCCACCAAGGATAAT T GGT TACAGCAGCT CTAGGAAACTAATACAGCT GCTAAAATGATCCC TGT 
CTCCTCGTGTTTACATTCTGTGTGTGTCCCCTCCCACAATGTACCAAAGTTGTCTTTGTGACCAA 
TAGAATATGGCAGAAGTGAT GGCAT GCCACTT CCAAGAT TAGGTTAT AAAAGACAC TGCAGCT T C 
TACTTGAGCCCTCTCTCTCTGCCACCCACCGCCCCCAATCTATCTTGGCTCACTCGCTCTGGGGG 
AAGC T AGCT GCCATGC TAT GAGCAGGCCTATAAAGAGAC T TAC G T GGTAAAAAATGAAGTC T CCT 
GCCCACAGCCACATTAGTGAACCTAGAAGCAGAGACTCTGTGAGATAATCGATGTTTGTTGTTTT 
AAGTTGCTCAGT TTTGGTC TAACTTGTTAT GCAGCAATAGATAAATAATATGCAGAGAAAGAG 
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FIGURE 152 

MVLSGALCFRMKDSALKVLYLHNNQLLAGGLHAGKVIKGEEISVVPNRWLDASLSPVILGVQGGS 
QCLSCGVGQEPTLTLEPVNIMELYLGAKESKSFTFYRRDMGLTSSFESAAYPGWFLCTVPEADQP 
VRLT QLPENGGWNAP I TDFYFQQCD 

N-myristoylation sites . 

amino acids 29-34, 30-35, 60-65, 63-68, 73-78, 91-96, 106-111 

Interleukin-1 signature. 

H amino acids 111-131 

•CI ' 

ttl Interleukin-1 proteins. 

| p| amino acids 8-29, 83-120, 95-134, 64-103 

i ■ H . . 

m . ■ 

■ S3 



ill 
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FIGURE 152 



CTTCAGAACAGGTTCTCCTTCCCCAGTCACCAGTTGCTCGAGTTAGAATTGTCTGCAATGGCCGC 
CCTGCAGAAATCTGTGAGCTCTTTCCTTATGGGGACCCTGGCCACCAGCTGCCTCCTTCTCTTGG 
CCCTCTTGGTACAGGGAGGAGCAGCTGCGCCCATCAGCTCCCACTGCAGGCTTGACAAGTCCAAC 
TTCCAGCAGCCCTATATCACCAACCGCACCTTCATGCTGGCTAAGGAGGCTAGCTTGGCTGATAA 
CAACACAGACGTTCGTCTCATTGGGGAGAAACTGTTCCACGGAGTCAGTATGAGTGAGCGCTGCT 
ATCTGATGAAGCAGGTGCTGAACTTCACCCTTGAAGAAGTGCTGTTCCCTCAATCTGATAGGTTC 
CAGCCTTATATGCAGGAGGTGGTGCCCTTCCTGGCCAGGCTCAGCAACAGGCTAAGCACATGTCA 
TATTGAAGGTGATGACCTGCATATCCAGAGGAATGTGCAAAAGCTGAAGGACACAGTGAAAAAGC 
TTGGAGAGAGTGGAGAGATCAAAGCAATTGGAGAACTGGATTTGCTGTTTATGTCTCTGAGAAAT 
GCCT GCAT T TGA CCAGAGCAAAGCT GAAAAAT GAATAAC TAAC CCCC TT TCCC TGCTAGAAATAA 
CAAT TAGATGCCCCAAAGCGATT TT TTTTAACCAAAAGGAAGATGGGAAGCCAAACTCCATCATG 
ATGGGTGGATTCCAAATGAACCCCTGCGTTAGTTACAAAGGAAACCAATGCCACTTTTGTTTATA 
AGACCAGAAGGTAGACTTTCTAAGCATAGATAT TTATTGATAACATT TCAT TGTAAC TGGTGTT C 
TATACACAGAAAACAAT T TATT T TT TAAATAAT TGTC T TT TTCCATAAAAAAGAT TACT TT CCAT 
TCCTTTAGGGGAAAAAACCCCTAAATAGCTTCATGTTTCCATAATCAGTACTTTATATTTATAAA 
TGTATTTAT TAT TAT T AT AAGAC TGC AT T T TAT T TAT AT CAT T T TAT T AAT AT GGAT T TATT TAT 
AGAAACAT CAT T CGATAT T GCTACT TGAGT GTAAGGCTAATAT T GAT AT T T ATGACAATAAT TAT 
AGAGC TATAACATGT T TATT TGACCT CAAT AAACAC TTGGAT AT CCC 
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FIGURE 154 

MAALQKSVS S FLMGTLAT S CLLLLALLVQGGAAAP I S S HCRLDKSNFQQPY I TNRT FMLAKEAS L 
ADNNTDVRLIGEKLFHGVSMSSRCYLMKQVLNFTLEEVLFPQSDRFQPYMQEWPFLARLSNRLS 
TCHIEGDDLHIQRNVQKLKDTVKKLGESGEIKAIGELDLLFMSLRNACI 

Important features of the protein: 
Signal peptide: 

amino acids 1-33 

N-glycosylation sites. 

amino acids 54-58, 68-72, 97-101 

N-myristoylation sites. 

amino acids 14-20, 82-88 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 10-21 
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FIGURE iss 

GGCTTGCTGAAAATAAAATCAGGACTCCTAACCTGCTCCAGTCAGCCTGCTTCCACGAGGCCTG^ 
CAG T CAG T GCCC GAC T TGT GAC T GAGT GT GCAG T GCCCAGCATGTACCAGGT CAG TGCAGAGGGC 
TGCCTGAGGGCTGTGCTGAGAGGGAGAGGAGCAGAGATG'CTGCTGAGGGTGGAGGGAGGCCAAGC 
TGCCAGGTTTGGGGCTGGGGGCCAAGTGGAGTGAGAAACTGGGATCCCAGGGGGAGGGTGCAGAT 
GAGGGAGCGACCCAGATTAGGTGAGGACAGTTCTCTCATTAGCCTTTTCCTACAGGTGGTTGCAT 
TCTTGGCAATGGTCATGGGAACCCACACCTACAGCCACTGGCCCAGCTGCTGCCCCAGCAAAGGG 
CAGGACACCTCTGAGGAGCTGCTGAGGTGGAGCACTGTGCCTGTGCCTCCCCTAGAGCCTGCTAG 
GCCCAACCGCCACCCAGAGTCCTGTAGGGCCAGTGAAGATGGACCCCTCAACAGCAGGGCCATCT 
CCCCCTGGAGATATGAGTTGGACAGAGACTTGAACCGGCTCCCCCAGGACCTGTACCACGCCCGT 
TGCCTGTGCCCGCACTGCGTCAGCCTACAGACAGGCTCCCACATGGACCCCCGGGGCAACTCGGA 
GCTGCTCTACCACAACCAGACTGTCTTCTACAGGCGGCCATGCCATGGCGAGAAGGGCACCCACA 
AGGGCTACTGCCTGGAGCGCAGGCTGTACCGTGTTTCCTTAGCTTGTGTGTGTGTGCGGCCCCGT 
GTGATGGGC TAGC CGGACCTGCTGGAGGCTGGTCCCTTTTTGGGAAACCTGGAGCCAGGTGTACA 
ACCACTTGCCATGAAGGGCCAGGATGCCCAGATGCTTGGCCCCTGTGAAGTGCTGTCTGGAGCAG 
CAGGATCCCGGGACAGGATGGGGGGCTTTGGGGAAAACCTGCACTTCTGCACATTTTGAAAAGAG 
CAGCTGCTGCTTAGGGCCGCCGGAAGCTGGTGTCCTGTCATTTTCTCTCAGGAAAGGTTTTCAAA 
GTTCTGCCCATTTCTGGAGGCCACCACTCCTGTCTCTTCCTCTTTTCCCATCCCCTGCTACCCTG 
GCCCAGCACAGGCACT TT C TAGATAT T TCCCCC T TGC TGGAGAAGAAAGAGCCCC TGG T TT TAT T 
TGTTTGTT TACT CAT CACTCAGTGAGCATCTACTTTGGGTGCATTCTAGTGTAGTTACTAGTCTT 
TTGACATGGAT GAT TC TGAGGAGGAAGCTGTTAT TGAATGTATAGAGAT T T ATCCAAATAAATAT 
CTTTATTTAAAAATGAAAAA 
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FIGURE 1*6 

MRERPRLGEDSSLISLFLQWAFLflMVMGTHTYSHWPSCCPSKGQDTSEELLRWSWPVPPLEPA 
RPNRHPESCRASEDGPLNSRAISPWRYELDRDLNRLPQDLYHARCLCPHCVSLQTGSHMDPRGNS 
ELLYHNQTVFYRRPCHGEKGTHKGYCLERRLYRVSLACVCVRPRVMG 



m 



Important features of the protein: 
Signal peptide: 

amino acids 1-32 

N-glycosylation site. 



□ 

fl amino acids 136-140 



Tyrosine kinase phosphorylation site. 

amino acids 127-135 

y i 

e N-myristoylation sites. 

p amino acids 44-50, 150-156 

Hi 



Hi 



rij 
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FIGURE 1*7 



CCGGCGATCTCGCTCGTGCTGCTAAGCCTGGCCGCGCTGTGCAGGAGCGCCGTACCCCGAGAGCC 
GACCGTTCAATGTGGCTCTGAAACTGGGCCATCTCCAGAGTGGATGCTACAACATGATCTAATCC 
CCGGAGAC TT GAGGGAC CTCCGAGTAGAACCT GT TACAAC TAGTGT T GCAACAGGGGACTAT TCA 
ATTTTGATGAATGTAAGCTGGGTACTCCGGGCAGATGCCAGCATCCGCTTGTTGAAGGCCACCAA 
GAT T TGT GTGACGGGCAAAAGCAACTTCCAGT CCTACAGC T GT GT GAGGTGCAAT TACACAGAGG 
CCTTCCAGACTCAGACCAGACCCTCTGGTGGTAAATGGACATTTTCCTACATCGGCTTCCCTGTA 
GAGCTGAACACAGTCTATTTCATTGGGGCCCATAATATTCCTAATGCAAATATGAATGAAGATGG 
C C C T T CC AT G T C T GT GAAT T T C ACC T C AC C AGGC T GCC TAGAC CACAT AAT G AAAT AT AAAAAAA 
AGTGT GT CAAGGCCGGAAGCCT GT GGGAT CCGAACAT CAC TGCT TGTAAGAAGAATGAGGAGACA 
GTAGAAGTGAACTTCACAACCACTCCCCTGGGAAACAGATACATGGCTCTTATCCAACACAGCAC 
TATCATCGGGTTTTCTCAGGTGTTTGAGCCACACCAGAAGAAACAAACGCGAGCTTCAGTGGTGA 
TTCCAGTGACTGGGGATAGTGAAGGTGCTACGGTGCAGCTGACTCCATATTTTCCTACTTGTGGC 
AGCGACTGCATCCGACATAAAGGAACAGTTGTGCTCTGCCCACAAACAGGCGTCCCTTTCCCTCT 
GGATAACAACAAAAGCAAGCCGGGAGGCTGGCTGCCTCTCCTCCTGCTGTCTCTGCTGGTGGCCA 
CATGGGTGCTGGTGGCAGGGATCTATCTAATGTGGAGGCACGAAAGGATCAAGAAGACTTCCTTT 
TCTACCACCACACTACTGCCCCCCATTAAGGTTCTTGTGGTTTACCCATCTGAAATATGTTTCCA 
T CACACAAT TT GT TAC T TCACTGAATT TCTT CAAAACCAT T GCAGAAG TGAGGT CATCC TTGAAA 
AGTGGCAGAAAAAGAAAATAGCAGAGATGGGTCCAGTGCAGTGGCTTGCCACTCAAAAGAAGGCA 
GCAGACAAAGTCGTCTTCCTTCTTTCCAATGACGTCAACAGTGTGTGCGATGGTACCTGTGGCAA 
GAGCGAGGGCAGTCCCAGTGAGAACTCTCAAGACCTCTTCCCCCTTGCCTTTAACCTTTTCTGCA 
GTGAT CTAAGAAGCCAGATTCATCTGCACAAATACGTGGTGGTCTACT TTAGAGAGAT TGATACA 
AAAGACGATTACAATGCTCTCAGTGTCTGCCCCAAGTACCACCTCATGAAGGATGCCACTGCTTT 
CTGTGCAGAACTTCTCCATGTCAAGCAGCAGGTGTCAGCAGGAAAAAGATCACAAGCCTGCCACG 
ATGGCTGCTGCTCCTTGTAG 
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FIGURE 158 



MSLVLLSLAALCRSAVPREPTVQCGSETGP.SPEWMLQHDLIPGDLRDLRVEPVTTSVATGDYSILMNVSWV 
LRADASI RLLKATKICVTGKSN FQSYSCVRCNYTEAFQTQTRPSGGKWTFSY IGFPVELNTVY FIGAHNI P 
NANMNEDGPSMSVNFTSPGCLDHIMKYKBCKCVKAGSLWDPNITACKKNEETVEVNFTTTPLGNRYMALIQH 
ST I IGFSQVFEPHQKKQTRASWI PVTGDSEGATVQLT PYFPTCGSDC IRHKGTWLCPQTGVPFPLDNNK 
SKPGGWLPLLLLSLLVATOTLVAGIYLMWRHERIKKTSFSTTTLLPPIKVLWYPSEICFHHTICYFTEFL 
QNHCRSEVILEKWQKKKIAEMGPVQW^TQKKAADKVVFLLSNDVNSVCDGTCGKSEGSPSENSQDLFPLA 
FNL FCSDLRSQI HLHKYVW Y FRE IDTKDDYNAL S VCP KYHLMKDATAFC AELLHVKQQVSAGKRSQACHD 
GCCSL 

M Important features of the protein: 

hJ Signal peptide: 

5 * B * amino acids 1-14 

■31 
hi 

Transmembrane domain: 

■■■■■ - 

?n t amino acids 290-309 

ffl 

M N-glycosylation sites. 

Q amino acids 67 - 71, 103 - 107, 156 - 160, 183 - 187, 197 - 201 and 283 

ill - 287 

kj 

ill cAMP- and cGMP-dependent protein kinase phosphorylation sites. 

W amino acids 228 - 232 and 319 - 323 

rii 

Casein kinase II phosphorylation sites. 

amino acids 178 - 182, 402 - 406, 414 - 418 and 453 - 457 

N-myristoylation site. 

amino acids 116-122 

Amidation site. 

amino acids 488-452 
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FIGURE 15Q 



AGCCACCAGCGCAACATGACAGTGAAGACCCTGCATGGCCCAGCCATGGTCAAGTACTTGCTGCT 
GTCGATATTGGGGCTTGCCTTTCTGAGTGAGGCGGCAGCTCGGAAAATCCCCAAAGTAGGACATA 
CTTTTTTCCAAAAGCCTGAGAGTTGCCCGCCTGTGCCAGGAGGTAGTATGAAGCTTGACATTGGC 
ATCATCAATGAAAACCAGCGCGTTTCCATGTCACGTAACATCGAGAGCCGCTCCACCTCCCCCTG 
GAATTACACTGTCACTTGGGACCCCAACCGGTACCCCTCGGAAGTTGTACAGGCCCAGTGTAGGA 
ACTTGGGCTGCATCAATGCTCAAGGAAAGGAAGACATCTCCATGAATTCCGTTCCCATCCAGCAA 
GAGACCCTGGTCGTCCGGAGGAAGCACCAAGGCTGCTCTGTTTCTTTCCAGTTGGAGAAGGTGCT 
GGTGACTGTTGGCTGCACCTGCGTCACCCCTGTCATCCACCATGTGCAGTAAGAGGTGCATATCC 
ACTCAGCT GAAGAAG 
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FIGURE 160 

MTVKTLHGPAMVKYLLLSILGLAFLSEAAARKIPKVGHTFFQKPESCPPVPGGSMKLDIGIINEN 
QRVSMSRNIESRSTSPWNYTVTWDPNRYPSEWQAQCRNLGCINAQGKEDISMNSVPIQQETLW 
RRKHQGCSVSFQLEKVLVTVGCTCVTPVIHHVQ 

Signal sequence: 

amino acids 1-30 

N-glycosylation site. 

amino acids 83-87 

N-myristoylation sites. 

amino acids 106-111, 136-141 
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FIGURE 161 



ACACTGGCCAAACAAAAACGAAAGCACTCCGTGC TGGAAGTAGGAGGAGAGTCAGGAC TCCCAGG 
ACAGAGAGTGCACAAACTACCCAGCACAGCCCCC TCCGCCC CC T C TG GAGGCTGAAGAGGGAT T C 
CAGCCCCTGCCACCCACAGACACGGGCTGACTGGGGTGTCTGCCCCCCTTGGGGGGGGGCAGCAC 
AGGGCCTCAGGCCTGGGTGCCACCTGGCACCTAGAAGATGCCTGTGCCCTGGTTCTTGCTGTCCT 
TGGCACTGGGCCGAAGCCCAGTGGTCCTTTCTCTGGAGAGGCTTGTGGGGCCTCAGGACGCTACC 
CACTGCTCTCCGGGCCTCTCCTGCCGCCTCTGGGACAGTGACATACTCTGCCTGCCTGGGGACAT 
CGTGCCTGCTCCGGGCCCCGTGCTGGCGCCTACGCACCTGCAGACAGAGCTGGTGCTGAGGTGCC 
AGAAGGAGACCGACTGTGACCTCTGTCTGCGTGTGGCTGTCCACTTGGCCGTGCATGGGCACTGG 
GAAGAGCCT GAAGATGAGGAAAAGT T T GGAGGAGCAGC TGACTCAGGGGT GGAGGAGCCTAGGAA 
TGCCTCTCTCCAGGCCGAAGTCGTGCTCTCCTTCCAGGCCTACCCTACTGCCCGCTGCGTCCTGC 
TGGAGGTGCAAGTGCCTGCTGCCCTTGTGCAGTTTGGTCAGTCTGTGGGCTCTGTGGTATATGAC 
fa TGCTTCGAGGCTGCCCTAGGGAGTGAGGTACGAATCTGGTCCTATACTCAGCCCAGGTACGAGAA 

G GAAC T C AACC AC ACAC AGC AG CTGCCTGCCCTGCCCTGGCT C AACG T G T CAGC AGAT G G T GACA 
ACGTGCATCTGGTTCTGAATGTCTCTGAGGAGGAGCACTTCGGCCTCTCCCTGTACTGGAATCAG 
t| GTCCAGGGCCCCCCAAAACCCCGGTGGCACAAAAACCTGACTGGACCGCAGATCATTACCTTGAA 
0! CCACACAGACCTGGTTCCCTGCCTCTGTATTCAGGTGTGGCCTCTGGAACCTGACTCCGTTAGGA 
hj CGAACATCTGCCCCTTCAGGGAGGACCCCCGCGCACACCAGAACCTCTGGCAAGCCGCCCGACTG 
Hi CGACTGCTGACCCTGCAGAGCTGGCTGCTGGACGCACCGTGCTCGCTGCCCGCAGAAGCGGCACT 

GTGCTGGCGGGCTCCGGGTGGGGACCCCTGCCAGCCACTGGTCCCACCGCTTTCCTGGGAGAACG 
TCACTGTGGACAAGGTTCTCGAGTTCGCATTGCTGAAAGGCCACCCTAACCTCTGTGTTCAGGTG 
Lfl AACAGCTCGGAGAAGCTGCAGCTGCAGGAGTGCTTGTGGGCTGACTCCCTGGGGCCTCTCAAAGA 
B CGATGTGCTACTGTTGGAGACACGAGGCCCCCAGGACAACAGATCCCTCTGTGCCTTGGAACCCA 

GTGGCTGTACTTCACTACCCAGCAAAGCCTCCACGAGGGCAGCTCGCCTTGGAGAGTACTTACTA 
CAAGACCTGCAGTCAGGCCAGTGTCTGCAGCTATGGGACGATGACTTGGGAGCGCTATGGGCCTG 
4] CCCCATGGACAAATACATCCACAAGCGCTGGGCCCTCGTGTGGCTGGCCTGCCTACTCTTTGCCG 
C| CTGCGCTTTCCCTCATCCTCCTTCTCAAAAAGGATCACGCGAAAGGGTGGCTGAGGCTCTTGAAA 
f|j CAGGACGTCCGCTCGGGGGCGGCCGCCAGGGGCCGCGCGGCTCTGCTCCTCTACTCAGCCGATGA 
W 'CTCGGGTTTCGAGCGCCTGGTGGGCGCCCTGGCGTCGGCCCTGTGCCAGCTGCCGCTGCGCGTGG 

CCGTAGACCTGTGGAGCCGTCGTGAACTGAGCGCGGAGGGGCCCGTGGCTTGGTTTCACGCGCAG 
CGGCGCCAGACCCTGCAGGAGGGCGGCGTGGTGGTCTTGCTCTTCTCTCCCGG'TGCGGTGGCGCT 
GTGCAGCGAGTGGCTACAGGATGGGGTGTCCGGGCCCGGGGCGCACGGCCCGCACGACGCCTTCC 
GCGCCTCGCTCAGCTGCGTGCTGCCCGACTTCTTGCAGGGCCGGGCGCCCGGCAGCTACGTGGGG 
GCCTGCTTCGACAGGCTGCTCCACCCGGACGCCGTACCCGCCCTTTTCCGCACCGTGCCCGTCTT 
CACACTGCCCTCCCAACTGCCAGACTTCCTGGGGGCCCTGCAGCAGCCTCGCGCCCCGCGTTCCG 
GGCGGCTCCAAGAGAGAGCGGAGCAAGTGTCCCGGGCCCTTCAGCCAGCCCTGGATAGCTACTTC 
CATCCCCCGGGGACTCCCGCGCCGGGACGCGGGGTGGGACCAGGGGCGGGACCTGGGGCGGGGGA 
CGGGACT TAAA TAAAGGCAGACGCTGTTTTTCTAAAAAAA 
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FIGURE 162 



til 



MPVPWFLLSLALGRSPWLSLERLVGPQDATHCSPGLSCRLWDSDILCLPGDIVPAPGPVLAPTHLQTELV 
LRCQKETDCDLCLRVAVHLAVHGHWEEPEDBEKFGGAADSGVEEPRNASLQAQWLSFQAYPTARCVLLEV 
QVPAALVQFGQSVGSVWDCFEAALGSEVRIWSYTQPRYEKELNHTQQLPALPWLNVSADGDWHLVLNVS 
EEQHFGLSLYWNQVQGPPKPRWHKNLTGPQIITLNHTDLVPCLCIQVWPLEPDSVRTNICPFREDPRAHQN 
LWQAARLRLLTLQSWLLDAPCSLPAEAALCWRAPGGDPCQPLVPPLSWENVTVDKVLEFPLLKGHPNLCVQ 
VNSSEKLQLQECLWADSLGPLKDDVLLLETRGPQDNRSLCALEPSGCTSLPSKASTRAARLGEYLLQDLQS 
GQCLQLWDDDLGALWACPiyTOKYIHKRWALW 

RAALLLYSADDSGFERLVGALASALCQLPLRVAVDLWSRRELSAQGPVAWFHAQRRQTLQEGGVWLLFSP 
GAVALCSEWLQDGVSGPGAHGPHDAFRASLSCVLPDFLQGRAPGSYVGACFDRLLHPDAVPALFRTVPVFT 
LPSQLPDFLGALQQPRAPRSGRLQERAEQVSRALQPALDSYFHPPGTPAPGRGVGPGAGPGAGDGT 

Signal sequence: 

amino acids 1-20 



fT! Transmembrane domain. 

■sr r 

ill amino acids 453-475 

?*\ 

W N-glycosylation sites. 

amino acids 118-121, 186-189, 198-201, 211-214, 238-241, 248-251, 
334-337, 357-360, 391-394 

Glycosarainoglycan attachment site. 

amino acids 583-586 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 552-555 

N-myristoylation sites. 

amino acids 107-112, 152-157, 319-324, 438-443, 516-521, 612-617, 
692-697, 696-701, 700-705 
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FIGURE 163 



GGGAGGGCTCTGTGCCAGCCCCG ATGA GGACGCTGCTGACCATCTTGACTGTGGGATCCCTGGCT 
GCTCACGCCCCTGAGGACCCCTCGGATCTGCTCCAGCACGTGAAATTCCAGTCCAGCAACTTTGA 
AAACATCCTGACGTGGGACAGCGGGCCAGAGGGCACCCCAGACACGGTCTACAGCATCGAGTATA 
AGACGTACGGAGAGAGGGACTGGGTGGCAAAGAAGGGCTGTCAGCGGATCACCCGGAAGTCCTGC 
AACCTGACGGTGGAGACGGGCAACCTCACGGAGCTCTACTATGCCAGGGTCACCGCT 
GTCAGTGCGGGAGGCCGGTCAGCCAC CAAGAT GACTGACAGGT T CAGC TCT CTGCAGCACACTAC 
CCTCAAGCCACCTGATGTGACCTGTATCTCCAAAGTGAGATCGATTCAGATGATTGTTCATCCTA 
CCCCCACGCCAATCCGTGCAGGCGATGGCCACCGGCTAACCCTGGAAGACATCTTCCATGACCTG 
TTCTACCACTTAGAGCTCCAGGTCAACCGCACCTACCAAATGCACCTTGGAGGGAAGCAGAGAGA 
ATATGAGTTCTTCGGCCTGACCCCTGACACAGAGTTCCTTGGCACCATCATGATTTGCGTTCCCA 
CCTGGGCCAAGGAGAGTGCCCCCTACATGTGCCGAGTGAAGACACTGCCAGACCGGACATGGACC 
U TACTCCTTCTCCGGAGCCTTCCTGTTCTCCATGGGCTTCCTCGTCGCAGTACTCTGCTACCTGAG 
'& s CTACAGATATGTCACCAAGCCGCCTGCACCTCCCAACTCCCTGAACGTCCAGCGAGTCCTGACTT 
fj. TCCAGCCGCTGCGCTTCATCCAGGAGCACGTCCTGATCCCTGTCTTTGACCTCAGCGGCCCCAGC 
AGTCTGGCCCAGCCTGTCCAGTACTCCCAGATCAGGGTGTCTGGACCCAGGGAGCCCGCAGGAGC 
01 T CCACAGCGGCATAGCCTGTCCGAGAT CACCTAC TTAGGGCAGCCAGACATC TCCATCC TCCAGC 

L|| CCTCCAACGTGCCACCTCCCCAGATCCTCTCCCCACTGTCCTATGCCCCAAACGCTGCCCCTGAG 
fn GTCGGGCCCCCATCCTATGCACCTCAGGTGACCCCCGAAGCTCAATTCCCATTCTACGCCCCACA 
%l GGCCATCTCTAAGGTCCAGCCTTCCTCCTATGCCCCTCAAGCCACTCCGGACAGCTGGCCTCCCT 
& CCTATGGGGTATGCATGGAAGGTTCTGGCAAAGACTCCCCCACTGGGACACTTTCTAGTCCTAAA 
111 CACCT TAGGCC TAAAGGTCAGC T T CAGAAAGAGCCACCAGC TGGAAGC TGCATGT TAGGTGGCCT 

g TTCTCTGCAGGAGGTGACCTCCTTGGCTATGGAGGAATCCCAAGAAGCAAAATCATTGCACCAGC 
f-i. CCC T GGGGAT T TGCACAGACAGAACATCT GACCCAAATGTGCTACACAGT GGGGAGGAAGGGACA 

rjj CCACAGTACCTAAAGGGCCAGCTCCCCCTCGTCTCCTCAGTCCAGATCGAGGGCCACCCCATGTC 

CCTCCCTTTGCAACCTCCTTCCGGTCCATGTTCCCCCTCGGACCAAGGTCCAAGTCCCTGGGGCC 
Q TGCTGGAGTCCCTTGTGTGTCCCAAGGATGAAGCCAAGAGCCCAGCCCCTGAGACCTCAGACCTG 
flj . GAGCAGCCCACAGAACTGGATTCTCTTTTCAGAGGCCTGGCCCTGACTGTGCAGTGGGAGTCCTG 
pi AGGGGAATGGGAAAGGCTTGGTGCTTCCTCCCTGTCCCTACCCAGTGTCACATCCTTGGCTGTCA 
SjJ ATCCCATGCCTGCCCATGCCACACACTCTGCGATCTGGCCTCAGACGGGTGCCCTTGAGAGAAGC 
Hi AGAGGGAGTGGCATGCAGGGCCCCTGCCATGGGTGCGCTCCTCACCGGAACAAAGCAGCATGATA 

AGGACTGCAGCGGGGGAGCTCTGGGGAGCAGCTTGTGTAGACAAGCGCGTGCTCGCTGAGCCCTG 
CAAGGCAGAAAT GACAGTGCAAGGAGGAAAT GCAGGGAAACTC C CGAGGT CCAGAGCCCCACC T C 
CTAACACCATGGATTCAAAGTGCTCAGGGAATTTGCCTCTCCTTGCCCCATTCCTGGCCAGTTTC 
ACAATCTAGCTCGACAGAGCATGAGGCCCCTGCCTCTTCTGTCATTGTTCAAAGGTGGGAAGAGA 
GCCTGGAAAAGAACCAGGCCTGGAAAAGAACCAGAAGGAGGCTGGGCAGAACCAGAACAACCTGC 
ACTTCTGCCAAGGCCAGGGCCAGCAGGACGGCAGGACTCTAGGGAGGGGTGTGGCCTGCAGCTCA 
TTCCCAGCCAGGGCAACTGCCTGACGTTGCACGATTTCAGCTTCATTCCTCTGATAGAACAAAGC 
GAAATGCAGGT CCACCAGGGAGGGAGACACACAAGCC T TT TCTGCAGGCAGGAGT T TCAGACCCT 
ATCCTGAGAATGGGGTTTGAAAGGAAGGTGAGGGCTGTGGCCCCTGGACGGGTACAATAACACAC 
TGTACTGATGTCACAACTTTGCAAGCTCTGCCTTGGGTTCAGCCCATCTGGGCTCAAATTCCAGC 
CT CACCACT CACAAGC TGT GTGACT T CAAACAAATGAAATCAGT GCCCAGAACCTCGGT TTCC TC 
AT CT GTAAT GT GGGGATCATAACACC TACCTCAT GGAGT TGTGGTGAAGAT GAAATGAAGTCATG 
TCTTTAAAGTGCTTAATAGTGCCTGGTACATGGGCAGTGCCCAATAAACGGTAGCTATTTAAAAA 
AAAAAAAA 
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FIGURE 164 



MRTLLTILTVGSLAAHAPEDPSDLLQHVKFQSSNFENILTWDSGPEGTPDTVYSIEYKTYGERDW 
VAKKGCQRITRKSCNLTVETGNLTELYYARVTAVSAGGRSATKMTDRFSSLQHTTLKPPDVTCIS 
KVRSIQMIVHPTPTPIRAGDGHRLTLEDIFHDLFYHLELQVNRTYQMHLGGKQREYEFFGLTPDT 
E FLGT IMI CVP TWAKESAPYMCRVKTLPDRTWT YS FS GAFLFSMGFLVAVLC YLS YRYVTKPPAP 
PNSLNVQRVLTFQPLRFIQEHVLIPVFDLSGPSSLAQPVQYSQIRVSGPREPAGAPQRHSLSEIT 
YLGQPDISILQPSNVPPPQILSPLSYAPNAAPEVGPPSYAPQVTPEAQFPFYAPQAISKVQPSSY 
APQAT PDS WPP S YGVCMEGS GKDSPTGTLSS PKHLRPKGQLQKE PPAGS CMLGGL S LQE VT S LAM 
EESQEAKSLHQPLGICTDRTSDPNVLHSGEEGTPQYLKGQLPLLSSVQIEGHPMSLPLQPPSGPC 
SPSDQGPSPWGLLESLVCPKDEAKSPAPETSDLEQPTELDSLFRGLALTVQWES 

Signal sequence. 

amino acids 1-17 

Transmembrane domain. 

amino acids 233-250 

N-glycdsylation sites. 

amino acids 80-83> 87-90, 172-175 

N-myristoylation sites. 

amino acids 11-16, 47-52, 102-107, 531-536, 565-570 
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FIGURE 165 

TGGCCTACTGGAAAAAAAAAAAAAAAAAAAAAAAGTCACCCGGGCCCGCGGTGGCCACA^^ 
CTGCGGCGCCGGGGCTGCTCTTCTGGCTGTTCGTGCTGGGGGCGCTCTGGTGGGTCCCGGGCCAG 
TCGGATCTCAGCCACGGACGGCGTTTCTCGGACCTCAAAGTGTGCGGGGACGAAGAGTGCAGCAT 
GTTAATGTACCGTGGGAAAGCTCTTGAAGACTTCACGGGCCCTGATTGTCGTTTTGTGAATTTTA 
AAAAAGGTGACGATGTATATGTCTACTACAAACTGGCAGGGGGATCCCTTGAACTTTGGGCTGGA 
AGTGTT GAACACAGT TT TGGATAT T T TCCAAAAGAT T T GAT CAAGG TACT T CATAAATACACGGA 
AGAAGAGCTACATAT TCCAGCAGATGAGACAGACT TTGTCTGCTTTGAAGGAGGAAGAGATGAT T 
T TAATAGT T ATAATGTAGAAGAGC T T TT AGGAT CT T TGGAAC T GGAGGAC TC TGTACC TGAAGAG 
T CGAAGAAAGCTGAAGAAGT TT CT CAGCACAGAGAGAAATCT CC TGAGGAGT C T CGGGGGCGT GA 
ACT T GACC CT G TGC C TGAGCCCGAGGCAT T CAGAGC TGAT TCAGAGGAT GGAGAAGGT GC TT TCT 
CAGAGAGCACCGAGGGGCTGCAGGGACAGCCCTCAGCTCAGGAGAGCCACCCTCACACCAGCGGT 
CCTGCGGCTAACGCTCAGGGAGTGCAGTCTTCGTTGGACACTTTTGAAGAAATTCTGCACGATAA 
ATTGAAAGTGCCGGGAAGCGAAAGCAGAACTGGCAATAGTTCTCCTGCCTCGGTGGAGCGGGAGA 
AGACAGATGCT TACAAAGTCCT GAAAACAGAAATGAGTCAGAGAGGAAGT GGACAGTGCGT TAT T 
CAT TACAGCAAAGGATTT CGTT GGCATCAAAATCTAAGTT TGT T TTACAAAGAT TGTTT T TAG TA 
CTAAGCT GCCT TGGCAGT T TGCAT T T TT GAGCCAAACAAAAATATAT TAT TTTCCCTTC TAAGTA 
AAAAAAAAAAAAAAAAAAAA 
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FIGURE 166 



MAAAPGLLFWLFVLGALWWVPGQSDLSHGRRFSDLKVCGDEECSMLMYRGKALEDFTGPDCRFVN 
FKKGDDVYVYYKLAGGSLELWAGSVEHS FGYFPKDLIKVLHKYTEEELHI PADETDFVC FEGGRD 
DFNSY3STVEELLGSLELEDSVPEESKKAEEVSQHREKSPEESRGRELDPVPEPEAFRADSEDGEGA 
FSESTEGLQGQPSAQESHPHTSGPAANAQGVQSSLDTFEEILHDKLKVPGSESRTGNSSPASVER 
EKTDAYKVLKT EMS QRGS GQCVI HYS KG FRWHQNLS L FYKDC F 

Important features of the protein: 
Signal peptide: 

amino acids 1-22 



yl N-glycosylation site. 

Ul amino acids 294-298 



cAMP- and cGMP -dependent protein kinase phosphorylation site. 

amino acids 30-34 



'■ 11 

'iff Tyrosine kinase phosphorylation site. 

r\ 

amino acids 67-7 6 

fll 

fjf N-myristoylation sites. 

amino acids 205-211, 225-231, 277-283 



Amidation site. 

amino acids 28-32 
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FIGURE 167 

CCAGGACCAGGGCGCACCGGCTCAGCCTCTCACTTGTCAGAGGCCGGGGAAGAGAAGCAAAGCGC 
AACGGTGTGGTCCAAGCCGGGGCTTCTGCTTCGCCTCTAGGACATACACGGGACCCCCTAACTTC 
AGTCCCCCAAACGCGCACCCTCGAAGTCTTGAACTCCAGCCCCGCACATCCACGCGCGGCACAGG 
CGCGGCAGGCGGCAGGTCCCGGCCGAAGGCGATGCGCGCAGGGGGTCGGGCAGCTGGGCTCGGGC 
GGCGGGAGTAGGGCCCGGCAGGGAGGCAGGGAGGCTGCATATTCAGAGTCGCGGGCTGCGCCCTG 
GGCAGAGGCCGCCCTCGCTCCACGCAACACCTGCTGCTGCCACCGCGCCGCGATGAGCCGCGTGG 
TCTCGCTGCTGCTGGGCGCCGCGCTGCTCTGCGGCCACGGAGCCTTCTGCCGCCGCGTGGTCAGC 
GGCCAAAAGGTGTGTTTTGCTGACTTCAAGCATCCCTGCTACAAAATGGCCTACTTCCATGAACT 
GTCCAGCCGAGTGAGCTTTCAGGAGGCACGCCTGGCTTGTGAGAGTGAGGGAGGAGTCCTCCTCA 
L.i GCCTTGAGAATGAAGCAGAACAGAAGTTAATAGAGAGCATGTTGCAAAACCTGACAAAACCCGGG 
h ACAGGGATTTCTGATGGTGATTTCTGGATAGGGCTTTGGAGGAATGGAGATGGGCAAACATCTGG 
■£ s TGCCTGCCCAGATCTCTACCAGTGGTCTGATGGAAGCAATTCCCAGTACCGAAACTGGTACACAG 
gj ATGAACCTTCCTGCGGAAGTGAAAAGTGTGTTGTGATGTATCACCAACCAACTGCCAATCCTGGC 
j" 4 , CTTGGGGGTCCCTACCTTTACCAGTGGAATGATGACAGGTGTAACATGAA.GCACAATTATATTTG 
CAAGTATGAACCAGAGATTAATCGAACAGCCCCTGTAGAAAAGCCTTATCTTACAAATCAACCAG 
^ GAGACACCCATCAGAATGTGGTTGTTACTGAAGCAGGTATAATTCCCAATCTAATTTATGTTGTT 
it; ATACCAACAATACCCCTGCTCTTACTGATACTGGTTGCTTTTGGAACCTGTTGTTTCCAGATGCT 
• GCATAAAAGTAAAGGAAGAACAAAAACTAGTCCAAACCAGTCTACACTGTGGATTTCAAAGAGTA 
r| CCAGAAAAGAAAGTGGCATGGAAGTATAATAACTCATTGACTTGGTTCCAGAATTTTGTAATTCT 
jjl GGATCTGTATAAGGAATGGCATCAGAACAATAGCTTGGAATGGCTTGAAATCACAAAGGATCTGC 
AAGATGAACTGTAAGCTCCCCCTTGAGGCAAATATTAAAGTAATTTTTATATGTCTATTATTTCA 
«J TTTAAAGAATATGCTGTGCTAATAATGGAGTGAGACATGCTTATTTTGCTAAAGGATGCACCCAA 
:«5 ACTTCAAACTTCAA.GCAAATGAAATGGACAATGCAGATAAAGTTGTTATCAACACGTCGGGAGTA 
•fjj- TGTGTGTTAGAAGCAATTCCTTTTATTTCTTTCACCTTTCATAAGTTGTTATCTAGTCAATGTAA 
TGTATATTGTATTGAAATTTACAGTGTGCAAAAGTATTTTACCTTTGCATAAGTGTTTGATAAAA 
ATGAACTGTTCTAATATTTATTTTTATGGCATCTCATTTTTCAATACATGCTCTTTTGATTAAAG 
AAACTTATTACTGTTGTCAACTGAATTCACACACACACAAATATAGTACCATAGAAAAAGTTTGT 
TTTCTCGAAATAATTCATCTTTCAGCTTCTCTGCTTTTGGTCAATGTCTAGGAAATCTCTTCAGA 
AATAAGAAGCTATTTCATTAAGTGTGATATAAACCTCCTCAAACATTTTACTTAGAGGCAAGGAT 
TGTCTAATTTCAATTGTGCAAGACATGTGCCTTATAATTATTTTTAGCTTAAAATTAAACAGATT 
TTGTAATAATGTAACTTTGTTAATAGGTGCATAAA.GACTAA.TGCAGTCAATTTGAACAAAA.GAAG 
TGACATACACAATATAAATCATATGTCTTCACACGTTGCCTATATAATGAGAAGCAGC'ECTCTGA 
GGGTTCTGAAATCAATGTGGTCCCTCTCTTGCCCACTAAACAAAGATGGTTGTTCGGGGTTTGGG 
ATTGACACTGGAGGCAGATAGTTGCAAAGTTAGTCTAAGGTTTCCCTAGCTGTATTTAGCCTCTG 
ACTATATTAGTATACAAAGAGGTCATGTGGTTGAGACCAGGTGAATAGTCACTATCAGTGTGGAG 
ACAAGCACAGCACACAGACATTTTAGGAAGGAAA.GGAACTACGAAATCGTGTGAAAATGGGTTGG 
AACCCATCAGTGATCGCATATTCATTGATGAGGGTTTGCTTGAGATAGAAAATGGTGGCTCCTTT 
CTGTCTTATCTCCTAGTTTCTTCAATGCTTACGCCTTGTTCTTCTCAAGAGAAAGTTGTAACTCT 
CTGGTCTTCATATGTCCCTGTGCTCCTTTTAACCAAATAAAGAGTTCTTGTTTCTGGGGGAflAAA 

AAA?yyyyywy\AAA?ywywyyvAAAAAAAAAAA 
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FIGURE 168 

MSRWSLLLGAALLCGHGAFCRRWSGQKVCFADFK1HPCYKMAYFHELSSRV FQEARLACESE 
GGVLLSLENEAEQKLIESMLQNLTKPGTGISDGDFWIGLWRNGDGQTSGACPDLYQWSDGSNSQ 
YR3STWYTDEPSCGSEKCVVMYHQPTANPGLGGPYLYQWNDDRCNMKH 

PYLTNQPGDTHQNWVTEAGIIPNLIYWIPTIPLLLLILVAFGTCCFQMLHKSKGRTKTSPNQ 
S-TLWISKSTRKESGMEV 

Important features of the protein: 
Signal peptide: 

amino acids 1-21 

Transmembrane domain: 

amino acids 214-235 



j-lj N-glycosylation sites. 

8 amino acids 86-89, 255-258 



cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 266-2 69 

N-myristoylation sites. 

amino acids 27-32/ 66-71, 91-96, 93-98, 102-107, 109-114, 140- 
145, 212-217 
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